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Abstract 

We consider the problem of minimizing the number of broadcasts for collecting all sensor mea¬ 
surements at a sink node in a noisy broadcast sensor network. Focusing first on arbitrary network 
topologies, we provide (i) fundamental limits on the required number of broadcasts of data gathering, 
and (ii) a general in-network computing strategy to achieve an upper bound within factor log N of the 
fundamental limits, where N is the number of agents in the network. Next, focusing on two example 
networks, namely, arbitrary geometric networks and random Erdos-Renyi networks, we provide improved 
in-network computing schemes that are optimal in that they attain the fundamental limits, i.e., the lower 
and upper bounds are tight in order sense. Our main techniques are three distributed encoding techniques, 
called graph codes, which are designed respectively for the above-mentioned three scenarios. Our work 
thus extends and unifies previous works such as those of Gallager HI and Karamchandani et. al. Ill on 
number of broadcasts for distributed function computation in special network topologies, while bringing 
in novel techniques, e.g., from error-control coding and noisy circuits, for both upper and lower bounds. 

Index terms: graph codes, noisy networks, distributed eneoding, sealing bounds. 

I. Introduction 

A. Motivations and Main Contributions 

Distributed data eolleeting in a multi-agent sensor network [HI is crueial in many applieations 
of data proeessing and network eontrol. We foeus on problems where there is one sink node in 
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the network that needs to eollect all sensor measurements for further funetion computation tasks, 
e.g., SUM, MAX, Majority, Parity, Histogram, etc. Although distributed data processing in sensor 
networks is often studied from the perspective of distributed in-network function computation [13]|, 
Il4l . our focus here is on the computation of the most communication-intensive function: the 
identity function (see, e.g. ffl, O, ifSll- lfT^ l. where the goal is to collect all the measurements 
themselves at the sink nodqj. This problem is of practical importance: as discussed in 0, “data 
gathering remains the primary service provided by wireless sensor networks”. Moreover, when 
the specific processing task of sensor measurements cannot be foreseen, collecting all sensor 
measurements is the safest strategy. Data gathering is also necessary in monitoring each agent in 
an emergency response system, for instance, the wearable wireless sensors that are connected with 
device-to-device links provide real-time monitoring signals for smart health care. An interesting 
application is the optimization of the waste collector truck route based on the load levels of waste 
containers in a smart city, where binary bits that indicate whether load levels exceed thresholds 
are reported by a large number of wireless sensors ifTTI to a remote data center. 

In the above-mentioned applications, data are often generated in sensors in the form of short 
and instant messages, and the number of sensors can be quite large. In this circumstance, 
communication throughput might not be the ultimate goal, since data are instant, instead of 
generated in streams. Following the seminal work of Gallager [HI, we consider communication 
complexity ifT^ . measured in number of broadcasts in bits, as the optimization goal. We assume, 
in each time slot, a network agent broadcasts a message bit to its neighborhood, and each 


other agent in this neighborhood receives an independent noisy cop>@ of the broadcast message. 
Without loss of generality, we assume that each network agent has only one bit of information 
and the sink node needs to collect all these bits with some required accuracy and minimum 
number of broadcasts. The network consists of {N + 1) agents (also referred to as nodes in the 
sequel), among which one agent is assigned as the sink (arbitrarily but decided apriori). These 
-I- 1 agents may directly communicate with subsets of other agents through unidirectional or 
bidirectional noisy links as determined by a preassigned (but arbitrary, possibly sparse) inter-agent 


*Any other function computation will need only fewer number of transmissions because if the sink node can reliably compute 
the identity function, it can also compute any other function reliably. 

^The assumption on noisy networks is suitable to model wireless sensor networks with limited transmission power and 
decoding capabilities. 
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communication network. We model noisy links as binary symmetric channels (BSC) or binary 
erasure ehannels (BEC). Note that the results on eommunieation eomplexity, whieh is the foeus 
of this paper, are often obtained under speeifie assumptions on the network strueture, ineluding 
eomplete networks [m, lISlI, IfTSl - lfTSl . grid networks ^ and random geometrie networks ll^, 1171, 
[fT^ - lfT9l . However, we seek to obtain results that are independent of the network topology. In 
other words, our goal is to characterize the eommunieation complexity scaling in networks with 
arbitrary topologies. A similar problem is also eonsidered in [[8l|, Il20l - ll22l . but the problem 
of data gathering in a noisy network is not eonsidered. Therefore, we believe that this work 
is the first to consider the minimum broadcasting complexity problem for data eollecting in a 
distributed network with noisy links and arbitrary topologies. Interestingly, the eommunieation 
eomplexity results in this paper eoineide with many existing results obtained under speeifie 
graph topology. The comparison between our work and related works is discussed in detail in 
Seetion II-Bl 

There are three major eomputation models in the field of in-network eomputing: one-shot 
eomputation |[I1, |151, O, [fT3]l . Ifldll . ifT^ . lfT9]l . bloek computation Q, ll^ and pipelined 

eomputation dUl, ifTOll . [l20l . If2^ . If2^ . We eonsider the one-time eomputation model, whieh 
means a one-time gathering of all the data, beeause eaeh node only has a short message, e.g., one 
bit of information, to be sent as a separate data packet. This kind of eommunieation problems with 
limited data is frequent |[T1 in distributed control of networks or a distributed monitoring system, 
where eaeh sensor is required to report just a few bits to deseribe the state of the eorresponding 
subsystem in a timely manner. Under the assumption of instant message eollecting, applying 
classic error control coding to cope with noisy links is highly non-trivial, since it is impossible 
for eaeh node to gather enough data to be eneoded into bloeks before being transmitted and 
distributed eneoding is necessary^ This is also one of the main reasons why we explicitly 
consider noisy ehannels, rather than eonsidering noiseless or effeetively noiseless ehannels (on 
whieh noise-free eommunieations ean be aehieved as long as the eommunieation rate is below 
the ehannel capaeity), as the elassical notion of channel eapacity is not generally applieable 
in seenarios involving instantaneous and distributed eneoding. Rather, an effeetive eomputation 


^In fact, we use linear block codes with distributed encoding techniques in the paper. However, the encoding is instant in 
contrast to classical coding theoretic frameworks which operate on (large) blocks of data. 
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(encoding) scheme in our context involves carefully designed in-network computations and inter¬ 
agent message exchanges (through neighborhood broadcasts). 

In dill, Gallager considers the data gathering problem in a complete graph and obtains an upper 
bound C>(A^loglogiV) on the communication complexity. Here, we address the same problem 
in general graphs (possibly very sparse) and obtain a general upper bound. Specifically, we show 
that this upper bound reduces to C>(A^loglogiV) as long as the network diameter stays bounded 
as —)■ oo. The main technique that leads to the generalization of Gallager’s result to arbitrary 
graph topologies is a distributed encoding scheme, called graph code, that extends error control 
coding to distributed in-network computations. The graph codes constructed in this paper are 
conceptually different from the encoding scheme developed by Gallager for complete networks. 
We first consider a general network and design a general graph code for it. Then, we modify 
this code to improve its performance in more specific graph topologies. The formal definitions 
of graph codes will be given in Section |IVl In the following, we briefly discuss the three graph 
codes that are used in this paper. 

1) QC-1 Graph Code in General Graphs: In Section |Vl general graph topologies are consid¬ 
ered and the QC-l graph code is provided. It is shown that in both BSC and BEG networks, the 
number of broadcasts required by the QC-l graph code is max{Q{dgN),Q{NlogN)}, where 
dg denotes the average distance from all agents to the sink. We also obtain a max [Q{dgN), 
©(iVloglogiV)} lower bound on the communication complexity through cut-set techniques in 
BSC networks, and a Q{dgN) lower bound in BEG networks using the same techniques. Note 
that there is a non-negligible gap between the above mentioned upper bound and the lower bound. 
When dg > ©(log A^), the upper bound coincides with the lower bound. When dg is small, there 
is at most a log N multiple between the two bounds. We also show a max{Q{dgN), Q(N log A^)} 
lower bound in constant-degree networks with BEG channels, which implies that the QC-l graph 
code also achieves optimality in this scenario. To provide better intuition, we explain through 
examples how this gap gets introduced. 

Compared with Gallager’s result [[I]|, ©(A^loglog A^) complexity in a complete graph, dgN 
characterizes the cost due to possibly large graph diameter. Therefore, in general networks, we 
may need strictly more communications than the complete graph, so the gap between Gallager’s 
and ours is not because our scheme is suboptimal. In fact, in Gallager’s setup, dg = 1. 
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2) QC-2 Graph Code in Geometric Graphs: Motivated by the above mismatch between the 

achievable result and the converse result, we reconsider the data gathering problem in general 
geometric graphs with BSC or BEC links (in contrast to random geometric graphs that have 
random node placement) in SectionFor these graphs, we design a new in-network distributed 
encoding scheme, referred to as the QC-2 code. In a geometric graph, all nodes are placed within 
a 1-by-l square, and each node is able to broadcast within a certain distance r < 1. We prove 
that, when r is larger than a threshold with order the communication complexity 

upper bound achieved by the QC-2 scheme is max{Q(dgN), ©(iVloglog A^)}, which coincides 
with the general lower bound in Section |Vl 

The QC-2 code utilizes this fact and each code bit calculated at a node v is the parity of the 
bits held by a subset (possibly strict) of its neighbors in a local complete graph. Therefore, the 
QC-2 utilizes local broadcasting among neighboring nodes as a means of reducing the distributed 
encoding cost. Interestingly, we show that the QC-2 code essentially reduces to the coding scheme 
in jSl Section 7] on complete graphs (see Remark H]). However, in contrast to the coding scheme 
developed in |151 Section 7] which applies to complete graphs only, the QC-2 code is applicable 
to a much broader class of graphs (arbitrary connected geometric graphs) and achieves function 
computation using the same number (in the order sense) of broadcasts. 

3) QC-3 Graph Code in Extended Erdds-Renyi Graphs: In Section IVIIl we investigate the 
same problem in a low-diameter graph, because we can see from the previous discussion that 
large graph diameters lead to the dgN gap with respect to the ©(iVloglog A^) bound in [HI. Our 
motivation is to determine instances of non-complete graphs where it is possible to achieve the 
bounds in [HI for complete graphs. We find that an Erdos-Renyi random graph [[25l suffices if 
two further assumptions are made: 

• More links are added to the Erdos-Renyi graph such that the multi-hop distance from each 
agent to the sink is bounded (e.g., when the sink is a central node and all other nodes have 
an extra directed link to it); 

• The noisy links are BEC instead of BSC. 

We call it the extended Erdos-Renyi graph. The technique utilized in the extended Erdos-Renyi 
graph is referred to as the QC-h code. Using the QC-h code, we show that the ©(A^ log log A^) 
upper bound can be achieved without the complete graph assumption. The applicability of the 
QC-3 code is however not limited to the extended Erdds-Renyi graphs. The QC-3 code may 
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outperform other types of graph codes in terms of error decay exponents in certain scenarios 
beyond the extended Erdos-Renyi case. For instance, we show that in a complete network with 
EEC channels (where both QC-2, QC-h and the scheme in |l5l Section 7] are applicable), the error 
exponent achieved by the QC-3 code is better than that of the QC-2 code (see Remark|7]) which, in 
turn, is a generalization of the scheme in dH Section 7] to general geometric graphs. This implies 
that although the QC-2, QC-3 and the scheme in (Si Section 7] all achieve a C>(A^loglog A^) 
complexity, QC-3 outperforms the others in terms of error probability decay rates. 

The analysis of the error probability of QC-3 code leads to, as by-products, new fundamental 
results in the design of erasure codes for point-to-point communications. In particular, we use 
the analyses for the QC-3 code to show that there exist sparse erasure codes that can achieve 
diminishing error probability decaying polynomially with the code length. 

We also borrow cut-set techniques of noisy circuits ll26ll . lETI . to derive a lower bound on 
the number of edges in the Erdos-Renyi graph, in order to determine the required sparseness 
of the underlying graph for reliable data collection. As by products, we show similar analysis 
techniques can be used to conclude that the number of ones in the generator matrix of an erasure 
code should be at least il{N log N) in order to achieve decaying block error probability. Note that 
il{N log N) is in the same scale as LT codes (Euby transform codes) lf28]l . In all, the QC-3 code 
has strong a relevance to erasure codes, and techniques in the in-network computing problem 
can be applied to the analysis of erasure codes for the classical point-to-point communication 
setup. 

The above mentioned three types of codes have the relationship QC-3cQC-2cQC-l according 
to the construction of each code bit. However, these codes have the encoding complexity 
relationship QC-3<QC-2<QC-l. Therefore, the high-index codes are simple but meant for specific 
graph scenarios, while the low-index codes are complicated but suitable for general topologies. 
By studying different graph codes, our goal is to theoretically understand in-network computing 
and data aggregation under the assumptions of link noise and distributed data, with the aim of 
minimizing the number of communications. Some of the major attributes of the three different 
types of graph codes are presented below. 
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Applicable Networks 

Analyzable in 

Number of Broadcasts 

gc -1 

Arbitrary connected networks 

BSC (Section V) 

BEC (Section V-D) 

max{Q{dgN),Q{N\ogN)} 

QC-2 

Arbitrary connected geometric networks 

BSC (Section VI) 

BEC (Section VI-C) 

max{0(JpA), 0(Vlog log A)} 

gc-3 

Extended geometric networks 

BEC (Section VII) 

0(Alog log A) 


B. Related Works 

As mentioned earlier, although our problem of minimizing the broadeast complexity in an 
arbitrary topology is new, the scaling bounds obtained in this paper coincide with many existing 
ones under the assumptions of specific graph topologies. In what follows, we show how our 
results are related to existing results in the literature of in-network computing. 

This work was initially inspired by the seminal work of Gallager |[T1, where the minimum 
broadcast complexity problem in a noisy complete network is examined. If the naive approach of 
repetitive coding, which neglects the broadcast nature of the receptions, is used, the number of 
transmissions scales as 0{N log N). However, in |[T1 a delicate broadcasting scheme is designed 
to achieve a complexity of ©(iVloglog A^) for the parity calculation problem and the identity 
calculation problem, i.e., data gathering. In [|5l, this bound is proved tight for the identity 
calculation problem. For general graphs, Gallager’s scheme is however, no longer applicable 
as it relies heavily on the complete graph structure. Nevertheless, this ©(iVloglog A^) bound 
still meets the upper bound obtained in Section and Section IVTII when dg = 0(1). The proof 
technique for the QC-3 code in Section IVIII is based on rank analysis of random matrices, and 
can only be applied to a BEG, which is different from Gallager’s original setting. Nonetheless, 
even if Gallager’s algorithm is applied to a BEG in the complete network setting, the achieved 
order continues to be ©(A^loglog A^). Moreover, the GC-3 scheme is applicable for Gallager’s 
complete setting with BEG links, but ends up using much fewer links in general. Therefore, 
our result in an extended Erdos-Renyi random graph can be viewed as a generalization of prior 
results under weaker topology assumptions. 

In [l2l, data gathering in a grid network is studied. Theorem IV. I and Theorem IV.2 in [|2ll state 
that, in an y/N x \/N grid broadcast network with a transmission radius r, the communication 
complexity for identity function computation is max {©(A^^^^/r), ©(A^ log log A^)}, which 
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matches with the lower bounds in Seetion |V] and the upper bound in Seetion |Vll In fact, the 
diameter for this network is in the order of jr. 

In [[^, the same problem in a random geometrie graph is examined. The proposition 2 
of im elaims that the eommunieation eomplexity is upper bounded by 0{N under the 
assumption that the diameter of the network is Therefore, their upper bound also 

matehes our general upper bound for arbitrary geometrie graphs. Although the eommunieation 
protoeol in [[6l| also has a sink-tree-based multi-hop relay proeedure similar to ours, their protoeol 
requires nodes to be evenly distributed in the graph. In faet, many works on network-eomputing 
in random geometrie graphs [0, [|71|, lfT^ - [[T^ . rely highly on the result that the graph 
ean be elustered in an even manner into groups with 0(logA^) nodes. Then, codes with length 
0(logA^) are repeatedly used to faeilitate hop-by-hop transmissions. This teehnique ean also 
be viewed as distributedly eneoding eodes with low-density generator matriees whieh have a 
structure as shown in lf29l See.IV]. However, in praetieal applieations of network computation, 
the elaim that nodes are evenly distributed might not hold OOl . OTI . 

From the perspeetive of eoding theory, the proposed ^C-3 eode is elosely related to erasure 
eodes that have low-density generator matriees (LDGM) G9l . G2l . In faet, the graph 
eode in this paper is equivalent to an LDGM erasure eode with noisy eneoding eireuitry G4ll . 
where the eneoding noise is introdueed by distributed eneoding in the noisy inter-agent eommu¬ 
nieation graph. Based on this observation, we show (in Corollary that our result directly leads 
to a known result in eapaeity-aehieving LDGM eodes. Similar results have been reported by 
G^ and for eommunieation and by Gill and G9l for distributed storage, both with noise- 
free eneoding. Due to eneoding noise, their analysis tools are not applieable for our problem. 
Moreover, our graph eode aehieves polynomially deeaying error probability with inereasing eode 
length (whieh is 2N, twiee the number of agents in the network), using only binary bits, rather 
than polynomially deeaying error with expanding Galois field dimension. We also obtain a lower 
bound on the number of ones in an erasure eode generator matrix with teehniques inspired by 
the analysis of graph eodes. Our work is also deeply related to network error eontrol eoding 
G5ll - G9l . but our work emphasizes more on the perspective of distributed eneoding in noisy 
networks. 
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II. Notation and Preliminary Results 


The calligraphic letter Q = (y,S) represents a graph with a node (vertex) set V and an edge set 
£. In this paper, an edge is directed unless otherwise stated. Each graph with N vertices has an 
7V-by-iV adjacency matrix A = (Am,n) = (^i, a 2 ,..., a^v), which represents the edges or network 
connections, i.e., Ajn,n = 1 if the node Vm has a directed edge to the node Vn, or equivalently, 
{vm, Vn) E 8. Denote the one-hop out-neighbors of a node v by A/"+ := {te G V| {v, w) E 8, w ^ 
t>}. Denote the one-hop in-neighbors of a node v by := {te G V|(w,l) E 8, w ^ v}. Note 
that the node Vm E J\f~{vn), if and only if = 1- If ^m,n = = 1, we say that and 

Vn are linked bidirectionally. In an undirected graph, i.e., in which all edges are bidirectional, 
Vm E M~{vn) is equivalent to Vm E AA+(n„). Thus, when the graph is undirected, we write 
N'{v) for simplicity. 

We will obtain scaling bounds on the communication complexity of discrete-time algorithms. 
Time is assumed to be discrete or slotted throughout the paper. The symbol t denotes time. The or¬ 
der notations fi{N) = 0{f2{N)) and fi{N) = f2(/2(iV)) respectively mean that fi{N)/f 2 {N) < 
Cl and fi{N)/f 2 {N) > C 2 for two positive constants Ci, C 2 and sufficiently large N. By 
fi{N) = Q{f 2 {N)) we mean that fi{N) = 0{f2{N)) and fi{N) = VL{f 2 {N)). 

By F 2 , we denote the binary field {0,1}. We will use basic results from error control coding, 
in particular, properties of binary linear block codes. A binary linear block code ll40ll with code 
length N and rate i? < 1 is a set of 2^^ binary vectors (codewords) that form a linear subspace 
C C F^. We always assume that NR is an integer. Each codeword c G C can be written as the 
product of a binary row vector m with length NR, called the message vector, and an NR x N 
binary matrix G, called the generator matrix. If G = [I, A], where I denotes the NR x NR 
identity matrix, we say that the code with the generator matrix G is systematic. 

A binary symmetric channel (BSC) with crossover probability e is a channel that flips a bit 
with probability e. A binary erasure channel (BEC) with erasure probability e is a channel that 
outputs an erasure value ‘e’ with probability e, no matter what value the input takes. Einally, we 
state two useful results from the theory of reliable communication HOll . The first one concerns 
repetition codes and the second one linear block codes for reliable message transmission over 
noisy communication channels. 

Lemma 1. ( [|40l Section 5.3]) Suppose we have a BSC with crossover probability e. If one 
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bit a; G F 2 is repeatedly transmitted through the ehannel for j times and the reeeiver uses the 
majority rule to make a deeision x the value of x, then, the bit error probability is upper bounded 
by 

pibit) ^ j^x)< [4e(l - e)]^/2. (1) 

Remark 1. Lemma [U states that ) repeated transmissions are suffieient to aehieve an 

error toleranee probability Pe at the destination, when the point-to-point souree to destination 
ehannel is a BSC. One might eonsider using adaptive sehemes, sueh as sequential deteetion iHTlI . 
to reduee the number of repetitions to aehieve the same level of Pe- However, this does not ehange 
the number of transmissions in order sense. 

Binary linear bloek eodes ean be used to transmit binary veetors over noisy ehannels. Suppose 
we have a iC-bit message veetor m and a eode C with length N and R = ^. Then, we ean 
eneode the message m into N bits by multiplying m with the generator matrix G, transmit these 
N bits over a ehannel and deeode the reeeived bits. The bloek error probability is defined as 
the probability that the deeoding result m is different from the original iC-bit message at least 
in one bit. The next lemma eharaeterizes the performanee of using binary linear eodes over a 
BSC. 

Lemma 2. ( HOl Theorem 5.6.2])(Random Coding Theorem) Suppose we have a iC-bit message 
veetor m to be transmitted through a BSC with erossover probability e. Then, for eaeh R < C, 
where C is the ehannel eapaeity, there exists a binary linear eode with length Nji and rate R, 
sueh that K < NrR and the iC-bit message ean be eneoded into Nr bits, transmitted through 
the BSC and deeoded with bloek error probability upper bounded by 

pibik) ^ ^ m) < exp[-KEr{e, R)/R], (2) 

where Er{e, P) > 0 is the random eoding exponent. 

The random eoding error exponent Er{e,R) for a BSC with erossover probability e ean be 
written as 

Erie, R) = max \—pR + Enip, e)l, 

0<p<l 

where 

Eoip,e) = pln2- (1 + p) In + (1 - . 

The random eoding error exponent Erie, R) is always positive for eoding rate R < C = 1—H (e). 
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III. System Model and Problem Formulations 
A. Data Gathering with Broadcasting 

Consider a network Q = {V,8) with iV + 1 agents V = {fn}n=o> where Vq is a preassigned 
sink node. Eaeh agent Vn with 1 < n < iV has one bit of information Xn G {0,1} distributed as 
Bemoulli(|). This is ealled the self-information bit. All self-information bits are independent of 
eaeh other. Denote the veetor of all self-information bits by x = {xi,X 2 ,..., xjv)^. The objeetive 
is to eollect x, in the sink vq with high accuracy. 

Time is slotted. In the f-th slot, only one chosen node v{t) is allowed to broadcasj^ one bit of 
information in F 2 to its out-neighborhood The channel between any two connected 

nodes is assumed noisy. Since we consider different noise models, we make two assumptions 
for convenience of reference. 

(A.la) BSC: All channels or graph edges are BSCs with identical crossover probability e G 
(0,1/2). All channels are independent of each other. 

(A.lb) BEC: All channels of graph edges are BECs with identical erasure probability e. All 
channels are independent of each other. 

^(N) 

A broadcast scheme ^ = {ft}t=i is a sequence of Boolean functions, such that at each time 
slot t the broadcasting node v{t) computes the function ft (whose arguments are to be made 
precise below) and broadcasts the computed output bit to its out-neighborhood. The parameter 
is used to denote the total number of broadcasts in a broadcasting scheme y which, in our 
setup, also corresponds to the time complexity or implementation time of because in each 
time slot, only one node is allowed to broadcast. The minimum value of ^ among all broadcast 
schemes is defined as the communication complexity of the data gathering problem, which is 
denoted as The arguments of ft may consist of all the information that the broadcasting 
node v{t) has up to time t, including its self-information bit Xy(t), randomly generated bits and 
information obtained from its in-neighborhood called the outer information. We only consider 
oblivious transmission schemes, i.e., the number of broadcasts \ all functions in and the 
broadcasting order {v{t)}tfi are predetermined. It also means that transmission by silence is 


^The transmission scheduling is beyond the scope of this paper. This paper address a fundamental issue, the communication 
complexity, which is minimum over all scheduling protocols in place. Nevertheless, transmission scheduling indeed improves 
the network throughput EH. 
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not allowed, i.e., a node has to broadcast when it is required. Further, we assume that a scheme 
terminates in finite time, i.e., < oo for all N. A scheme obviously has to be feasible, 

meaning that all arguments of ft should be available in v{t) before time t. Denote by T the set 
of all feasible oblivious schemes. The final error probability is defined as = Pr(x 7 ^ x), 
where x denotes the final estimate of x at the sink vq. Usually it is required that the error 
probability is asymptotically bounded or lim Pi^^ < ptai- where ptai niight be zero, which 

N^oo 

means that the error probability should be small even if the number of vertices in the network 
is large. Although our objective does not involve convergence rate requirements, in this paper, 
convergence rates are indeed given for all constructive results. The problem to be studied is 
therefore 


min 

s.t. lim Pj^^ < Ptar- 

N^oo 


(3) 


We call this problem the noisy broadcasting problem. In this paper, we will consider both fixed 
graph topologies and random graph topologies, which will be clear in the next subsection. The 
above mentioned error probability Pi^^ needs to be interpreted in the expected sense when 
dealing with random graph topologies. Specifically, for random topologies, denote by Pf the 
(conditional) error probability conditioned on an instance Q of the communication graph. If 
the graph Q involved is deterministic, Pf = otherwise, for random graph topologies, the 
conditional error probability is itself a random variable and the error probability metric Pi^^ 
is defined as the expected error probability Pj^^ = Eg [Pf] (When dealing with random graphs, 
the quantities IPg(-) and Eg[-] denote probability and expectation with respect to the distribution 
of the random graph ensemble.). The transmission scheme design problem for random graphs 
is the same as in ([3]). 


B. Network Models 

When working with deterministic (but arbitrary) graph topologies, we assume that the network 
is connected. Specifically, we impose the following connectivity assumption. 

(A.2) Network Connectivity: In the directed graph Q = {V,8), the sink node vq is reachable 
from each non-sink node w G V \ {no} through a sequence v ^ —)■ •••—>■ no of directed 

edges. 
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In (A.2), recall that if the graph is undirected, each (bidirectional) edge corresponds to the 
two directed edges. Since network connectivity is necessary for data collection, we assume this 
assumption holds throughout. 

We use T = (V, S-j-) to represent the breadth-first search (BPS) spanning tree [|4^ of ^ = 
{V,8) rooted at the sink vq. The edge set Ej- is a subset of E and \Er\ = |V| — 1. A BPS tree 
can be constructed as follows: 

• Initialize: Vr = Er = 0. 

• Pind all directed edges (u, v) e E such that u ^Vr and v E Vr- Include (u, v) in Er and 
include u in Vr- 

• Repeat the previous step until Vr = V. 

By assumption (A.2), the BPS tree exists. By d{v,vo), we denote the multi-hop distance from 
a node v to the sink vq. An obvious property of the breadth-first search spanning tree T is that 
the multi-hop distance d{v,Vo) is the same in T as in the original graph Q. By the (-th layer 
V; C V, we denote the set of nodes that have identical multi-hop distance d{v,vo) = 1. Denote 

Ld 

the maximum distance from a node v to the sink vq by Ld- We know that V = \jVi forms a 

1=1 

layered partition of the node set. In the BPS tree, the parent-node Vf of a node v is defined to 
be the unique node such that there exists a directed edge {v,Vf) in the BPS tree’s edge set Er- 
The descendants of a node v is defined as the set Vy <Z V that includes all nodes w that are 
connected to v through a sequence of directed edges in Er¬ 
in Section |Vl we consider the noisy broadcasting problem on a general graph. The broadcasting 
scheme in [|TI| is designed for complete graphs and not directly applicable here. Intuitively, the 
communication complexity is higher for general graphs, in contrast to complete graphs, because 
a non-negligible routing complexity might be incurred due to the (possibly) large distances of 
some non-sink nodes to the sink vq. 

In Section IVH we consider the noisy broadcasting problem in geometric graphs. By geometric 
graph, we mean each node is connected to and can only communicate with nodes that are within 
a certain (specified) distance of itself. The formal definition is given in the assumption (A.3a). 
Por comparison, we will cite a result on random geometric graphs (bl . The definition of random 
geometric graphs is given in the assumption (A.3b). Por arbitrary graphs and geometric graphs, 
we provide communication complexity results for both BECs and BSCs. 

(A.3a) Geometric Graph: The graph Q = {V,E) is assumed to be a geometric graph, i.e.. 
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all + 1 nodes in V are loeated in a 1-by-l square area, and any two nodes are eonneeted 



bidireetionally if they are within a speeified distanee r. Further, we assume that r > 


where Cg is a eonstant. Finally, we assume that Q is eonneeted, whieh means that Q satisfies the 
assumption (A.2]l 

(A.3b) Random Geometric Graph: The graph Q = {V,8) is assumed to be a geometrie 
graph whieh satisfies the assumption (A.3a^^. Moreover, eaeh node in V is distributed uniformly 
in the 1-by-l square area, independently of other nodes. 

In Seetion IVIIl we eonsider the noisy broadeasting problem in the extended Erdos-Renyi 
network, whieh is slightly different from the original Erdos-Renyi model in |[25ll . The definition 
is given in the assumption (A.4). In this model, the eonneetion probability p = indieates 

that the average node degree is 0(logA^). We will also show that the minimum average node 
degree is at least if the error probability of data gathering is required to approaeh 

zero when the node number approaehes infinity. This result states that p = is minimum 

in the order sense exeept for a log log N faetor. A sink might be a base station and all agents 
have direet links to iR In this seetion, links are assumed to be BECs as in the assumption (A. lb). 
Eurthermore, in the extended Erdos-Renyi network, the error probability in dU) should be 
replaeed by EdPe^^), where the expeetation is taken over all random graph instanees. See 
Seetion IVIIl for more details. 

(A.4) Extended Erdos-Renyi Graph: The extended Erdos-Renyi graph is an ER graph with 
the minimal number of additional links that ensures that eaeh non-souree node has direeted link 
to the sink. In the graph Q = (V, £), all eonneetions are independent of eaeh other. Assume that 
p satisfies p = where c is a eonstant. We further assume that eaeh node in V has a direet 

link to the sink, in addition to the random eonneetions between these nodes themselves. 

Assumption (A.4) ean be interpreted as follows: the edge set £ ean be deeomposed into 

^This assumption is required because connectivity within distance r does not necessarily ensure connectivity. 

®Note that the connectivity assumption (A.2) here is still needed, although the random geometric graph is connected with 
high probability if r is large enough. 

’However, as long as these direct links are noisy, the communication complexity for the data gathering is in the order of 
0{N log N) (2 if a naive scheme is used that aims to transmit the self-bit of each node to the sink through the corresponding 
direct link. We will prove that in-network computation makes this complexity smaller by utilizing more communications 
between non-sink nodes, i.e., information fusion. Moreover, these inter-node communications are usually cheaper than direct 
communications with the base station. 
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S = U 82, where 81 is the set of direeted edges eonneeting non-sink nodes, whieh form the 
edge set of a direeted Erdos-Renyi network with eonneetion probability p = and 82 can 

be viewed as the minimum set of edges that is further added to the graph with edge set 81 so 
that each non-sink node has a directed link to the sinlo To be precise, in the standard Erdos- 
Renyi generation, there might already be some edges between sink and non-sink nodes. The 
set 82 is the additional set of source to non-source links not obtained through the Erdos-Renyi 
instantiation. 


IV. Main Techniques: Graph Codes 

Graph codes are distributed linear block codes which have generator matrices closely related 
to the network structure. Designing graph codes relies on the utilization of the network structure. 
We introduce three different types of graph codes in the following. The following descriptions 
are informal, details and their usage will be made clear in the subsequent sections. 

A QC-3 graph code is a rate-| systematic code with a generator matrix G = [I, A] with A 
being the graph adjacency matrix. The encoding of a QC-3 graph code can be written as 

rT = x’^-[I,A], (4) 

where denotes the message vector with length N and denotes the encoding output with 
length 2N. This means that the code bit calculated by a node v is either its self-information bit 
Xv or the parity of the self-information bits in its in-neighborhood Af~. Therefore, QC-3 codes 
are easy to encode with local communications and admit distributed implementations. However, 
the decoding can be quite difficult depending on the graph structure. We only use QC-3 codes 
for the extended Erdos-Renyi networks with BECs in Section IVIII 

A QC-2 graph code is also a rate-| code with a generator matrix G = [I, A]. However, A is 
the adjacency matrix of a subgraph Q = {V, 8 ) of Q = {V, 8 ), where 8 C 8 . Alternatively, a 


*It can be shown that this assumption can be relaxed by assuming that the graph is an Erdos-Renyi graph with the minimum 
number of additional links to ensure bounded multi-hop distance dmax from each non-sink node to the sink, That is, the edge set 
£2 can be viewed as the minimum set of edges that is further added to the graph with edge set £i so that each non-sink node 
has a directed path of length smaller or equal to dmax to the sink. This relaxation can be made because it does not affect the 
scaling hounds on the number of transmissions in the achievable scheme for the extended Erdos-Renyi Graph in order sense. 
More specifically, one bit can always be transmitted using a directed path to the sink node with 0 ( 1 ) transmissions to obtain 
an error probability e, which means that by increasing the number of transmissions by a constant multiple, the directed path of 
length dmax between a non-sink node and the sink node can he viewed as a directed link with error probability e. 
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QC-2 graph code may be viewed as a generalization of the ^C-3 code when some edges in the 
original edge set E are removed. This eode is mueh more flexible than the ^C-3 eode and we 
will use it for geometrie graphs in Seetion |Vll 

A QC-\ graph eode has no direet relationship with the adjaceney matrix but the idea is similar 
to the previous eodes. It assumes that each code bit ealeulated at a node v is the parity of a 
subset of nodes that are within 0(logiV) hops of v. This eode is quite general and we will show 
that, for arbitrary graph topologies, a QC-l graph eode can help achieve an upper bound on the 
eommunieation eomplexity of data gathering which is at most a 0(logiV) multiple of the lower 
bound. 

The eode length of the three graph eodes are all in the order of 0(iV). Sinee the eode length 
is in the same order as the number of nodes in the network, and only one-shot eomputing of 
distributed eneoding is required for the one-shot data gathering problem, the average number 
of bits ealeulated by each node during the distributed eneoding proeess is a eonstant (details 
follow in the subsequent seetions). Furthermore, all three types of graph eodes are designed to 
possess a sparseness property: the number of ones in the generator matrix will be in the order 
of 0(A^logiV), beeause the studied graphs (either geometric graphs or extended Erdos-Renyi 
graphs have a sparse strueture). This means that eneoding eaeh bit requires only 0(logA^) self¬ 
information bits. Therefore, effieient distributed eneoding with a small number of broadeasts 
beeomes possible. 


V. QC-l Graph Codes in a General Graph 

In this seetion, we eonsider general eonneeted network topologies. We first eonsider this 
problem on BSCs satisfying the assumption (A. la), then we extend the results to BECs satisfying 
the assumption (A. lb). We design a general distributed in-network computing algorithm called 
the QC-l graph code. Reeall that in the case of complete networks, as studied in [HI, |l5l, a lower 
bound on the eommunieation eomplexity for data gathering is 0(A^loglog A^). In what follows, 
we provide a lower bound for general networks. Then, we use the QC-l graph eode to get an 
upper bound which, we show, is close to the lower bound when the graph diameter is small, and 
meets the lower bound when the diameter is large. We also give an intuitive example on why 
this upper bound can be achieved and why there is a small gap between the lower and upper 
bounds. 
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A. Communication Complexity Lower Bound in a General Graph 

As shown in Fig.[Tl construct the breadth-first-search spanning tree of the network Q = (V, £), 

Ld 

and then, eonstruet the layered partition V = IJ V; of the network based on the multi-hop distanee 

1=1 

d{vn,vo) from each node Vn to the sink vo, as defined in Section ITTT-RI Note that the distanee 
d{vn,vo) in the tree is the same as in the original network. 

By definition of the BFS spanning tree and the associated layering, we know that in the graph 
Q, no edges exist between non-suceessive layers, but edges eonneeting nodes in the same layer 
may exist. By /-th eut, we denote the set of edges from the /-th layer V; to the (/ — l)-th layer 
Vi-i. We know that information ean only be transmitted hop-by-hop from the bottom layer 
to the sink. Therefore, on eaeh out between two layers of the multi-layer BFS spanning tree, 
there is a eertain amount of information that needs to be transmitted. The overall number of 
broadoasts ean be lower bounded by the sum of information necessary to be transmitted on all 
of these disjoint outs. This gives the basio lower bound for data gathering, i.e., transmitting the 
vector X = (xi, X 2 , ...^xnY to vq. 

Theorem 1. Suppose the oommunioation links in the graph Q satisfy the assumption (A.la). 
Then, if all data are gathered at the sink xq with error probability by a feasible broadcasting 
scheme y, the oommunioation eomplexity is necessarily bounded below by 


> c4gN, 


(5) 


where = i-ule) ^ constant, N denotes the number of nodes in the graph and dg is the 
average distanee to the sink, defined as 


dg 


N 


— J2YVn,Vo). 


n=l 


( 6 ) 


Proof: Denote by zi the whole data reoeived in the sink vq during the entire data gathering 
process, i.e., when the broadoasting seheme in plaee terminates. Then, for eaeh bit x„, we know 
that Xn —)■ zi ^ Xn is a Markov ohain, where Xn is the estimate of Xn at vq. Therefore, based 
on the data processing inequality and Fano’s inequality [|4^ . it holds that 


//(X„|zi) <H{Xn\Xn) < i/(Pbit) + Pbitlogdxl " 1) = i^(Pbit), (7) 

where Pbit is the bit error probability Pr(x„ 7 ^ Xn) of estimating Xn from zi and the seoond 
equality follows from the fact that |x| = IF 2 I = 2. Since Xi, X 2 ,..., xat are assumed to be 
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First Layer First Cut 



Fig. 1. A grid network layered by the distance to the sink. The spanning tree rooted at the sink is represented by a solid line. 


independent of eaeh other, we know that H (x) = N. Therefore, we have 

(a) ^ 

/(x; zi) = i7(x) - if(x|zi) >N-J2 H{xn\zi) > iV[l - H{Pu,)], (8) 

71=1 

where (a) holds because 

N N 

H{x\zi) = '^H{Xn\zi,Xi, . . .,Xn-l) < '^H{Xn\zi). 

71=1 71=1 

Since vq has no side information about x to start with, the amount of (mutual) information 
/(x; zi) needs to be broadcasted via the BSCs in the first cut. Assume the number of broadcasts 
in the first cut is '^i. Then, since the number of channel uses is and each channel use has 


capacity 1 — if(e), from the cut-set bound that /(x; zi) < ^i(l — H{e)) we must have 

/(x;zi) Ar(l-if(Pbit)) . 

^ - 1 - H{e) - 1 - H{e) “ ^ ^ 

where step (a) follows from the fact that the bit error probability Pbit is always smaller than total 
error probability Pe^\ 

For each layer I, denote by Si the set of nodes in the union Vi IJ Vz+i U ''' U Denote all 
self-information bits in Si by xsi- Define A) = Similarly, we obtain 




( 10 ) 


1 - i/(e) - 1 - i/(e) 

where zi is the data transmitted through the /-th cut and step (b) is obtained by combining (|7]) 


with the data processing inequality 


I{,Xn]Zi) > /(a;n;zi),Vn e Sp 


( 11 ) 
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To show that (fTTI) holds, we reason as follows: from the definition of layers and cuts, if Xn is 
below the V/, the information of Xn has to be routed through the Z-th cut before it gets to the 
first cut, otherwise, Xn must be in upper layers of the Z-th cut, a contradiction. More formally, 
for Xn € Su we note that zi is conditionally independent of Xn given z^, and hence (fTTI) holds. 
Summing over all Z, we obtain the following lower bound on the (total) number of broadcasts: 

Ld Ld 

( 12 ) 

1=1 1=1 

where step (c) follows from the definition of dg and the exchange of summation. Thus, we 
obtain (fS]). This lower bound holds for any broadcast scheme so (IT^ is a lower bound on the 
communication complexity ■ 

B. In-network Computing Algorithm 

In this part we provide the QC-l in-network computing algorithm for gathering all data at vq in 
an arbitrary network. Before we provide the algorithm, we provide some preparatory procedures 
as follows. First, we construct the BFS spanning tree T = (V,£^r) rooted at the sink no. as 
defined in Section ITTT-Bl That is, in the layered network shown in Fig. fH we delete all edges in 
the same layer but reserve edges that span adjacent layers. The resulting network is like Fig. [2] 
and the edge set is denoted by Sq-. As defined in Section ITTT-Bl denote all descendants of the 
node V hy V.U. Define 

Br = {v eV ■.\Dy\ <'y log N}, (13) 

where 7 is a constant. Define Aj- = V \ Bq-. It is obvious that each path from a leaf-node Vn 
to the root vq is constituted by a series of nodes in Br, followed by another series of nodes in 
At (as shown in Fig. O. 

Then, we propose the QC-l algorithm, as shown in Algorithmfll The basic idea is: Each G V 
gathers all self-information bits from its descendants in V^. Then, it sends all the information in 
D^U{f}, including bits from its descendants and its own self-information bit, to its parent-node. 
In order to make this scheme a feasible in-network computing scheme, each node v has to start 
transmitting after all of its children nodes complete transmitting. 

All nodes use linear block codes to encode the information that it needs to transmit. Nodes 
with small descendant size {\Vn\ < ylogN) has to insert zeros (dummy bits) to the message 
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Fig. 2. The in-network computing algorithm carried out on the spanning tree. 


vector before encoding. The performance guarantee of this algorithm is shown in Theorem [2l 
The intuition underlying why the error probability is small is put in Remark |2l 

Theorem 2. Suppose the communication links in the graph Q satisfy the assumption (A.la). 
Then, for each tuple of constants {R, 7 ) satisfying 

R <'yEr{e,R), (14) 

where Er{e, R) is the random coding error exponent from dH), the number of broadcasts that the 
scheme y provided in Algorithm [U incurs is upper bounded by 

<N{^ + l) + N{-f\ogN/R + l) = max{e{dgN),eiN log N)}, (15) 

where N denotes the number of nodes in the graph and dg is the average distance to the sink, 
which is defined in db]). Moreover, as N ^ 00 , the error probability decreases polynomially 
as 

p{N) < ^ exp[-P,(e, R)/R]) , (16) 

and, in particular, achieves limAr^oo= 0. 

Proof: In what follows, we show how to obtain the upper bound on the number of broadcasts 
in dB]), while the error probability analysis of dMl) is put in the Appendix Each node v G B-p 
(including leaf-nodes) transmits a codeword of size \'jlog N/R], so the number of broadcasts 
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Algorithm 1 ^C-1 algorithm 

Initialization; Construct the BPS spanning tree T = (V,£^r) rooted at the sink vq. 

Step 1: Eaeh leaf-node v eneodes the binary vector (x^,0, ...,0) with length ylogA^ using 
random eoding with rate R and transmits the eodeword to its parent-node. 

Step 2: Eaeh non-leaf node v, from its ehildren-nodes, reeeives the self-information bits of 
its entire set of deseendants Vy. After all of its ehildren-nodes finish transmitting, the node v 
relays the self-information bits of all of its descendants and its own self-information bit to its 
parent-node, using error eontrol eodes. Depending on if v is in Br or Ar, the eoding sehemes 
differ. The eoding details are shown below. 

• Actions in Br- Eaeh v G Br deeodes the self-information bits from and form a binary 
veetor with length Vy + l with its own self-information bit. Then the node v inserts 7 log N— 
1 — ID^I zeros to the veetor to make the length ylogA^ and uses random eoding to encode 
this veetor. Einally, it sends the whole \{\Vv\ + 1)/-^! bits to its parent-node, where R is 
the eoding rate. 

• Actions in At- Each v G At decodes the self-information bits from V^, and uses 
random coding to encode these bits and its own self-information. Einally, it sends the 
whole \{\'D^\ + 1)/R] bits to its parent-node, where R is the coding rate. 


at each node v e Bt satisfies 

<7logiV//2+l. 

The number of broadcasts at each node v G At is 

K = \iVv + l)/R] < {Vv + l)/R + l. 


(17) 


(18) 


Therefore, the final number of broadcasts is 

'^v + '^v < + 1 )/7? -i- 1 ] -i- ^^(7 log 7? -1- 1 ) 

vGAt v&Bj- 'uSV vGV (19) 

=7V(^ + 1 ) + iV (7 \og N/R + 1). 

H 

In Appendix lAl the remaining part of the theorem, i.e., Eq. (fT^ . is proved in detail. ■ 

Remark 2. The nodes in Bt all have a descendent size \V^\ < 7 logiV, and hence they do not 
have enough data to use powerful error control codes with large code length, unless dummy 
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bits are inserted. The eode length 7 log N is to ensure that, the probability that all transmissions 
in Bj- are reliable, deeays polynomially with N under the union bound. The nodes in Ar all 
have large deseendent size, so they ean use powerful error eontrol eodes to earry out bloek 
transmissions with low error probability. 

C. Comparison between the Upper Bound and the Lower Bound 

Clearly, when the average distanee dg to the sink is large and grows polynomially with N, 
the first term in the RHS of (fT5l) dominates. Thus, the upper bound is the same order as the 
lower bound in Theorem [T] when the average multi-hop distanee dg is large. In this seetion, we 
make a summary of results both in this paper and [15]| and diseuss the tightness of the obtained 
sealing results in different eases. 

Corollary 1. Suppose the eommunieation links in the graph Q satisfy the assumption (A.la). 
Then, the eommunieation eomplexity of data gathering has an upper bound and an 

lower bound satisfying 


<^(N) ^ max{0((igA^), 0(A^log A^)}, 

= max{Q{dgN), 0(A^loglog A^)}. 


( 20 ) 


( 21 ) 


Proof: Considering dS]) and (fTSl) . to prove (l2TI) . it suffiees to show = f2(A^loglog A^). 
In faet, it is stated in Theorem 1 in [5] that if the number of noisy broadeasts is 

_ I3{n)N, 


the error probability Pe^'^ that the reeeiver does not output all self-information bits satisfies 



( 22 ) 


Then, we have 



DRAFT 


Wednesday February, 2017 








23 


Dividing both the LHS and the RHS with 4 log (i), we have 

log 13 log log N log(l - - J^) - log(48 log(i)) 

B H->— - — -1- - - 

2 log i 4 log i 4 log i (23) 

=f2(log log A^). 

From (l23l) . we immediately have I3{N) = f2(loglog A^). ■ 

The lower bound is tight in the order sense in many eases. An example to support this 
elaim in the low-diameter regime is the ©(A^loglog A^) eommunieation eomplexity upper bound 
obtained in eomplete graphs in |[ll . An example in the high-diameter regime is the grid network 
studied in [[2l|. Theorem IV. 1 and Theorem IV.2 in [|2l| prove that in a ^/n x y/n grid broadeast 
network with a transmission radius r, the eommunieation eomplexity for data gathering is 
max{0(A^^/^/r), ©(A^loglog A^)}, whieh matehes the lower bound in this seetion, if the faet 
that the typieal diameter for this network is /r is eonsidered. 

However, the upper bound obtained by the QC-l algorithm might not be tight in all oeeasions. 
For example, it is apparently loose when dg <2. To show this elaim, plug m dg <2 into (l20l) 
and (I2TI) . Then, we know that = ©(A^logA^), = ©(A^loglog A^). This mismateh 

is beeause the QC-\ algorithm is designed for general graph topologies and is not adaptive 
in speeifie graph topologies. However, the QC-\ algorithm ean be improved in speeifie graph 
topologies to meet the lower bound. We use the following example to show a basie topology 
strueture that helps aehieve the lower bound, whieh motivates the geometrie graph in the next 
seetion. 

Example 1. Consider the examples shown in Fig. [3l We abuse the terminologies, and use ‘heavy- 
tail’ to deseribe the ease of (a), and use ‘light-tail’ to deseribe the ease of (b). Suppose in the 
heavy-tail star network (a), there are 7 log N nodes that form a elique (eomplete graph) on the 
end of eaeh tail, where 7 satisfies the eondition (fT4l) in Theorem |2l All nodes in these eliques 
form the set Bj- defined in (flSl) . Then, we modify Algorithm [T] by letting eaeh v G Bj- broadeast 
jt = r ^iogp/ 4 e^(i^r)t^ ~l other nodes in the elique that v lies in, where Pch < 1/2 is a 

eonstant. This modifieation ehanges the number of broadeasts at eaeh node v G Bj- from (fTTI) 
to = it = ©(log log A^). Therefore, the total number of broadeasts is ehanged from (fT9l) to 

<j/Sr| + [{V, + l)/R+l] = max{©(iVloglogiV),©(4iV)}, (24) 

vGAj- 
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Fig. 3. (a) A heavy-tail star network that achieves the lower bound ED; (b) A light-tail star network that cannot achieve the 

lower bound. 


which achieves the lower bound (l2TI) . It ean be shown that the overall probability of getting an 
error in the broadeasts in all eliques deeays polynomially with N (see Seetion IVI-BI) . 

However, for the light-tail network shown in Fig. [3] (b), there is no eonvenient strueture to 
be utilized for a broadeast. When the length of eaeh tail is greater than 7 log N, we ean use 
error eontrol eoding for the nodes in Ar, but the nodes in Bj- (nodes that are elose to the tail 
ends) ean only insert dummy bits to obtain large eode length in order to ensure reliability. This 
issue limits the number of broadeasts to seale as = max{0(A^log A^), Q{dgN)}. Further, 
when the length of eaeh tail is smaller than 7 log N, we ean only use error eontrol eoding with 
dummy bits and length 0(logiV) at all nodes, sinee all nodes are in S7-. This limits the number 
of broadeasts to seale as = 0(A^log A^). Therefore, the total number of broadeasts has the 
same form as (l20l) . and does not reaeh the lower bound. 

Remark 3. The heavy-tail strueture in Fig. [3] (a) is the basie strueture that aehieves the Q{dgN) 
upper bound by using error eontrol eoding. This is essentially the strueture eonsidered in |l2l 
Theorem IV.2]. In the following seetion, we eonsider a general geometrie graph, whieh is 
essentially a generalization of this heavy-tail network strueture. 

D. Extension to Binary Erasure Channels 

The eonelusion of the previous seetion ean be easily generalized to BECs. 

Corollary 2. Suppose the eommunieation links in the graph Q satisfy the assumption (A. lb). 
Suppose the parameters of Algorithm [T] is the same as in Theorem [2l exeept that all the error 
probability e is ehanged into e/2. Then, using Algorithmfil we ean aehieve polynomially deeaying 
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error probability. The number of broadeasts ^ has an upper bound that is the same with (fT5l) . 

Proof: Note that when a bit is erased by the BEC, we ean always flip a fair eoin and assign 
a random binary value to this bit on the reeeiver side. The equivalent ehannel of eombining a 
BEC with erasure probability e and a fair eoin flip is a BSC with erossover probability e/2. Thus, 
all eonelusions of Theorem [21 holds, after the erossover probability of all BSCs are ehanged to 
e/ 2 . ■ 


Corollary 3. Suppose the eommunieation links in the graph Q satisfy the assumption (A. lb). 
Then, if all data are gathered at the sink uq with error probability by a feasible broadeasting 
seheme the eommunieation eomplexity is neeessarily bounded below by 


> cJgN, 


(25) 


where ^—- is a constant, N denotes the number of nodes in the graph and dg is the 

average distanee to the sink, defined as 


dg 


N 


— ^d{Vn,Vo). 


n=l 


(26) 


Proof: The proof is almost exaetly the same as the one of Theorem [TJ The only differenee 
is that the ehannel eapaeity of eaeh BEC link is 1 — e instead of 1 — if (e). ■ 

The upper bound that we obtained in Corollary [2] is iaa.x{Q{dgN), Q{N log N)}. Therefore, 
the two bounds meet with eaeh other when dg = VL{logN), whieh holds in many networks, sueh 
as a square grid network of size x y/N. 

In faet, one ean show that in some other types of graphs, the lower bound il{N log N) is also 
valid for BEC models. We will show in the following that for a network with eonstant degree, 
il{N log N) is a valid lower bound on the number of broadeasts for the noisy broadeast problem. 
Therefore, for this partieular type of networks, max{0((JgA^), 0(A^log A^)} is both the upper 
and lower bound on the number of broadeasts. 


Lemma 3. Suppose eaeh node u G V in the graph Q = (y,S) satisfies deg(u) < D and P is a 
eonstant. Then, for any seheme y' to obtain an output x with eonstant error probability Pr(x 7 ^ 
x) < (5 in the noisy broadeast problem, the number of broadeasts satisfies = Q{NlogN). 
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Proof: For an arbitrary node n G V, suppose the number of broadeasts by v is Then, 
sinee the degree of v satisfies deg(f) < D, the probability that all broadeasts made by v are 
erased is lower bounded by 

(27) 


where recall that e is the erasure probability. If all broadcasts from one particular node are erased, 
the sink can never recover the entire input bits x. Therefore, 


PW = Pr(x ^ x) > 1 - J](l -p,) > 1 - J](l - e 




(28) 


v£V 


This implies that 


1 - 5 < 1 - Pi^^ < JJ(1 - e^"^”) < 


vev 


N 


Ed 


vev 




where (a) is from 1 — x < exp(—x). Rearranging the terms in the above inequality gives 

N logiV — loglog(l/(l — 5)) 


v£V 


N 


N 


E- 

vev 




N 


(29) 


E*i> 


vev 


D 


log(l/e) 


= n{N\ogN). 


(30) 


VI. QC-2 Graph Codes in a Geometric Graph 

In the previous section, we considered the communication complexity problem in a general 
graph where the upper and lower bound has a 0(logiV) gap. In the following sections, we con¬ 
sider graphs where the communication complexity lower bound max{Q{dgN), ©(iVloglogiV)} 
can indeed be achieved. The in-network computing algorithm utilized in this section is the QC- 
2 graph code. Our QC-2 graph code in this section is based on cell partitioning in geometric 
graphs and node replication. In particular, we partition all nodes in the network into cells based 
on geographic location. If we can partition nodes into groups of 0(log N) and each group forms a 
local complete graph, we can then use similar ideas from jSll to aggregate data, i.e., we can use a 
short code of length 0(logiV) to aggregate data reliably in a local complete graph. However, for 
general geometric graphs, this partitioning does not apply directly. Thus, we introduce “dummy 
nodes”, so that the number of nodes in each cell always exceeds 0(logA^). For the geometric 
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Fig. 4. A geometric graph where each node can only broadcast within a certain distance. The node wi can broadcast to V 2 
and V3 but cannot broadcast to 114,. Cells 1 and 4 are grouped into dense set Sd while cells 2 and 3 are grouped into sparse 
set Ss. Dashed circles around a solid node represents the replications of this node, i.e., dummy nodes. 


graph with connection distance r > y (see Assumption (A.3a)) that we consider, the 

introduction of dummy nodes does not change the number of broadcasts in order sense. 

We first consider geometric graphs Q = (V,£^) that satisfy the connectivity assumption 
(A.2), the topology assumption (A.3a) and the channel assumption (A.la). Extensions to random 
geometric graphs with assumption (A.3b) and BECs with (A. lb) are given in Section IVI-Cl As 
shown in EiglU we use a square tessellation scheme to partition the 1-by-l area into B'^ = 
small squares, such that each square has length Vg < We call each small square a cell. By the 
assumption of a geometric graph, each node is connected to all other nodes in the same cell since 
the diagonal of each cell is smaller than r. Therefore, for any two nodes v and v' in the same cell, 
the difference between the multi-hop distances to the sink vq satisfies \d{v,vo) — d{v',vo)\ < 1. 
According to the topology assumption (A.3a), -, and hence 


< 




(31) 


Denote by Vi the nodes that belong to a particular cell indexed by 1. Then, the node set V is 
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divided into subsets, whieh is written as 

V = lJVz. (32) 

i=i 

We use S = {1,2,... ,-B^} to denote the eell index set. In eaeh eell, eall the node with the 
minimum distanee to the sink no the cell head. Separate S into two parts S = Sg, where 

5, = {/g 5||V;| >plogiV}, (33) 

and Ss = S\Sd, where p is a eonstant. It is elear that Sd denotes the eells where the nodes are 
dense and Sg denotes the opposite (see Fig. H]). 

For eaeh eell I in the set of dense eells Sd, partition Vi into groups, sueh that the number of 
nodes in eaeh group ranges between p log N and 2p log N. Eaeh group is loeated in the same eell, 
and hence forms a local complete graph. For each cell I in the set of sparse cells Sd, if V; 7 ^ 0, 
replicate each node, together with the held self-information bit, for times. Therefore, in 

each cell, there are |V;| ^ [plog A^, 2plogiV] replicated dummy nodes (see Fig. H]). Each 

dummy node holds one dummy self-information bit that replicates the original self-information 
bit. We assume that all the dummy nodes in a cell I form a single group. In the following 
sections, we provide the algorithm for the graph with dummy nodes. But it should always be 
clear that all actions taken by a dummy node is actually implemented by the original physical 
node. By introducing dummy nodes, all nodes in V are partitioned into groups of 0(plogA^) 
nodes, and all nodes in one group form a local complete graph. The reason to partition V into 
approximately even groups of size 0(log A^) is to ensure the polynomial decay of the final error 
probability with N, which will be explained soon in detail. Suppose the total number of groups 
is D. Denote by v_a the cell head of the corresponding cell that the group A is located in. 

A. In-network Computing Algorithm 

The algorithm for data gathering in a geometric graph has two steps. In the first step, in each 
cell, all self-information bits are gathered in the cell head, using a QC-2 code. In the second step, 
a backbone network constituted by cell heads is constructed, in order to route all information to 
the sink vq. 

We first design the generator matrix of the QC-2 graph code. As defined in Section |IVl a 
QC-2 graph code is a binary linear block code with a generator matrix G = [I, A], where A is 
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the adjacency matrix of a subgraph of the original graph. The structure of this generator matrix 
ensures that each code bit can be calculated by local information exchanges. Here we design 
the generator matrix to be G = [I, A], such that A is a block diagonal matrix written as 

A = Diagjii, is, • • • (34) 

where D is the number of groups, and the m-th block Am is a square matrix that has the same 
size as the m-th group. This definition is always valid, because each group forms a local complete 
graph, and since a sub-graph of a complete graph can have arbitrary topology, we know that 
each block Am can be arbitrary, as long as it is symmetric and its size is the same as the size 
of the m-th group. In the algorithm to be shown, we require each group to distributedly encode 
all of its self-information bits with a generator matrix Gm = [I■, Am], which means that we are 
decoupling the encoding with matrix G into local computations in each group (and equivalently, 
in each cell) based on the block diagonal structure of A. This distributed encoding can clearly 
be done with local information exchange. It should be noted that the matrix A is not actually 
the adjacency matrix of a subgraph of the original graph Q, but of the augmented graph with 
replicated dummy nodes in sparse cells. 

Apart from to be block diagonal, we further require each block Am of A to satisfy the 
property that a systematic code with the generator matrix Gm = [I, Am] achieves the random 
coding exponent in Lemma [2l which ensures that each local encoding process yields a codeword 
with powerful error correcting capabilities. The formal description of the local encoding scheme, 
or the local gathering scheme, is given in the ‘Local Computing’ part of Algorithm [2l In each 
group Am, as mentioned above, a rate-4 graph code with the generator matrix Gm = [I, Am] 
and code length 2|Am| > 2plogA is utilized to distributedly encode all data in this group and 
ensure reliable decoding in the cell head. In particular, denote by Xm the vector that contains all 
self-information bits in group Am- Then, the encoding yields x]]]Gm = [x^,x]]]Am]- Thus, each 
code bit can be calculated using local broadcasts, since each code bit is either a self-information 
bit, or the parity of some self-information bits in a local complete graph. 

It should be noted that in a sparse cell, although the actual actions are taken by the physical 
nodes, these actions can be viewed as being performed by the dummy nodes without changing 
the statistical properties of the distributed encoding scheme. To be specific, let each physical 
node broadcast its self-information bit for jg ■ times (here B is the only group in the 
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Fig. 5. A geometric graph routing backbone. Both sparse and dense cells introduce dummy bits to facilitate routing. 


sparse cell and is the same as Vi in (l3^ 1. which is the same with letting each dummy node 
broadcast jg times. Then, each dummy node receives all other bits, and computes one code 
bit signified by the local graph code generator matrix Gm- Finally, all these code bits and all 
self-information bits (all the dummy bits) are transmitted to the cell head to be decoded. The 
broadcast channel between two replications (dummy nodes) of the same physical node is actually 
a perfect channel, which only incurs less errors. 

In the routing step, all self-information bits are routed along a backbone network. The algo¬ 
rithm is given in the ‘Backbone Routing’ part in Algorithm [2l As explained in the algorithm, if 
two cell heads Vi and Vj may interact with a path Uj —)■ u' —)■ u' —)■ Vj, they are defined to be 
connected in the backbone network. Since the underlying network Q = (V,£^) is connected, the 
backbone network is also connected. Each node in the spanning tree T of the backbone network 
is a cell head and has the task of forwarding all the self-information bits in the corresponding 
cell. Furthermore, each node in the backbone network has to relay all information bits from its 
children-nodes (direct descendants) in T as well. This decode-and-forward routing is carried on 
along the entire spanning tree T of the backbone network, until the sink vq receives all the data, 
i.e., all the self information bits in the whole network. The number of dummy bits introduced in 
each cell head v is smaller than plog A^. As shown in the proof, the reason to append 0{plogN) 
dummy bits in each node v E T at the end of the ‘Backbone Routing’ part of Algorithm [21 is 
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to ensure the polynomial deeay of error probability with N. Moreover, appending 0{p\ogN) 
dummy bits in eaeh eell does not ehange the number of broadeasts in the order sense. 


B. Upper Bounds on the Error Probability and the Number of Broadcasts 

In the following lemma, we analyze the error probability and the number of broadeasts in the 
loeal eomputing step. 

Lemma 4. Assume Q = (V, 8) satisfies the topology assumption (A.3a) and the ehannel assump¬ 
tion (A.la). Further assume that 

4pF;^(e+ pch, ^) > 1, (37) 

where e is the ehannel erossover probability, pch < 1/2 is a eonstant defined in (1351) . p is the 
eonstant defined in (l3^ and Erf) is the random eoding exponent for BSCs. Then, using the 
loeal eomputing step in Algorithm [2l i.e., the QC-2 eode, and using ©(A^loglog A^) number 
of broadeasts, all eell heads learn all the self-information bits in their own eells with high 
aeouraey, that is, the total error probability in the loeal eomputing step, Pe,local = Pr(3n e 
T,v has a wrong deeoding output), eventually deeays polynomially with N. 

Proof: Aeeording to Lemma [H after transmitting eaeh bit x for jg times (defined in (l35]) f. 
the bit X is erroneous with error probability 


Pe < [4e(l 


e )]2 < 


Pch 

2p\ogN 


(38) 


Sinee eaeh eode bit ealeulated at a node Vm,,i is the XOR of at most 2plogA^ self-information 
bit (defined in (l36l)). by the union bound, eaeh eode bit of the QC-2 eode is eneoded ineorreetly 
with probability 

Pe,v = Pr(Xm,i f Xmp) < ' 2p log N = Pch- (39) 

2p log N 

By Lemma [2l this makes the error probability of reeovering all self-information bits in group 
A at the eell head v_a be upper bounded by 


^elocai < exp[-4plogiVP,(e + p,h, ^)] = 


(40) 
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Algorithm 2 Identity-Function Computation in a Geometric Graph 

• Initialization: For all cell I G Sd, partition Vi into groups where each group has plogiV to 

2plogiV nodes. For all cell I G Sg, replicate each node times and form one group 

in this cell. 

• Local Computing: For each group Am, first let each node in Am broadcast its self¬ 
information bit for jg times where 


. _ 21og(2plogiV/pch)n 
' logll/4£(l - £)] '■ 


(35) 


and Pch < 1/2 is a constant. 

Secondly, each node computes one code bit using the corresponding column in Am, the 
m-th sub-matrix of the generator matrix G. For example, the i-th node Vm,i in the m- th 
group calculates 


Vn 


= X 




(36) 


where §.^,1 is the i-th column of Am, and j is the majority-rule-based estimate of Xm,i 
at Vm,i based on information received by Vm,i from the first step of local computing, where 
Xm,i is the self-information bits of the nodes in group Am- The summation is in the sense 
of modulo-2. Then, all these code bits are transmitted to the node v_a, the cell head. 
Thirdly, each node in the group transmits its own self-information to n_ 4 . 

Finally, performs decoding on the received bits to recover all self-information bits in 
group A. 

• Backbone Routing: Construct the backbone network constituted by all cell heads in the 
following way: two cell heads Vi and Vj in cell i and cell j are connected if there exist one 
node u' in cell i and v' in cell j such that the pair (n', v') is in the original edge set £. 
Construct the breadth-first-search spanning tree T of the backbone network rooted at the 
sink vq. Each node relays all information bits from itself and its descendants in T to its 
parent-node. 

Suppose a node v e T needs to route a binary vector x. Then, v partitions x into blocks 
with length plogiV. If the length of x is not a multiple of plogiV, some dummy bits are 
appended into x. After that, v encodes each block with a rate-i? block code and transmits 
the codeword to the parent-node. 
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The error probability upper bound is the same for sparse and dense eells. Therefore, the total 
error probability is 


P. 


e,local 



pA 

^ e,local 


is in a dense cell 



pB 
e, local 


B is in a sparse cell 


jy- (e+Pch, I) 

(e+Pch, \) 


E 


1 + 


E 


, is in a dense cell 


B is in a sparse cell 


N f I 2N 

plogN \^y CglogiV 


(41) 


whieh eventually deeays polynomially with N when ApEr{e + Pch, > 1- Note that step (a) 
follows from (l40l) and step (b) follows from the faet that the number of groups in dense cells 
is upper bounded by and the number of groups in sparse cells is upper bounded by the 

total number of cells m (l3TI) . 

The number of broadcasts consumed by group ^ in a dense cell is 

= & + (42) 


where |^| denotes the number of nodes in group A, and the constant 2 is because each node, 
apart from broadcasting its own self-information bit for jg times, has to transmit a code bit 
and its own self-information bit to the cell head. Similarly, consider the fact that each node is 
replicated into dummy nodes, we know that the number of broadcasts consumed by 

group in a sparse cell is 

<., = 0, + 2)-|B|r^^l. (43) 

Since 

leir^^l < iei(^^ +1) = lfil + plogiV < 2plogiV, (44) 

we have 

'^ocai < {jg + 2) ■ 2p log N. (45) 


Thus, the total number of broadcasts is 

+E<..<&+2)(Ew + E 2p logiV) 


A 


A 


<{jg + 2) 


N -I- 2plog 


2N 


Cg log N 


+ 1 


(46) 


= 0(ArioglogAr), 


Wednesday P* Febraary, 2017 


DRAFT 
















34 


where we have used jg = ©(loglogiV) in (1^ and (|3TI) . ■ 

Lemma Instates that ©(loglogiV) broadeasts suffice to make all cell heads successfully gather 
all local information. After that, cell heads form a backbone network and all local information 
is routed to the sink vq. The analysis of the whole Algorithm [21 is given in the following. 

Theorem 3. Suppose Q = (V,£^) satisfies the topology assumption (A.3a) and the channel 
assumption (A.la). Suppose the parameters of Algorithm [2| satisfy (1371) and 

^Er{e,R)>3/2, (47) 

where the parameters p and e are defined the same as in Lemma jH R is the code rate of backbone 
routing, and Er{-) is the random coding exponent for BSCs, as defined in ([21). Then, using the 
in-network computing scheme defined in Algorithm [2l in which the number of broadcasts scales 
as max{©(dgA^), ©(A^loglog A^)}, the final error probability eventually decays polynomially 
with N. 


Proof: We respectively analyze the total error probability and the total number of broadcasts. 
As defined in the backbone routing step of Algorithm |2l each transmission relays a block of 
plogA^ bits with a rate-i? code. Thus, the error probability of each block on one transmission 
is bounded by 


e, routing 


< ex 




(48) 


In all, the number of appended dummy bits is at most plogA^ • (at most plogA^ dummy 


bits in each cell and cells), and hence the number of blocks is at most 


N+p\ogN-B^ 

plogN 


. Each 


block is transmitted along at most 3 • 2B hops, where the multiple 3 is because each path 
Vi ^ v[ ^ Vj —)■ Vj between two cell heads is constituted by at most three hops in the 
underlying graph Q, and the multiple 2B is the longest multi-hop distance to the sink on a 
B X B grid. Using the union bound, the error probability that the sink vq gets a wrong version 
of all information bits is bounded from above by 

ft,™,., < 6B ■ ^ . (-TU + b2)a,-*e.(.,h). (49) 

plogA plogA 

Using the fact that B = 0{N^), we know that the total routing error probability decays 
polynomially if j^Er{e,R) > 3/2. 

As for the number of broadcasts in the backbone routing phase, each bit from a cell head v 
is now routed along a path on the backbone network, the length of which is at most 3 times 
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the original distance d{v, Vq). All appended dummy bits are only transmitted for one hop on the 
backbone network, or equivalently, at most three hops on the original network. Therefore, the 
total number of broadcasts for routing is 

^routing <MgN + 3p\ogN ■ < 3dgN + 3plogiV(J + l)^ = e{dgN). (50) 

y Cg log N 

Combining (l4T])(l4^(l49l)(l50l) . we know that the overall error probability decays polynomially 
with N and the number of broadcasts scales as max \^Q{dgN), ©(A^loglog A^)}. Therefore, 
the proof is completed. ■ 

Remark 4. The proposed QC-2 code can be viewed as an extension of the coding scheme in 
E Section 7] in complete graphs to arbitrary connected geometric graphs. In a complete graph, 
we can partition all nodes into non-overlapping cells of size ©(log A^). Then, nodes in each cell 
form a complete graph of size ©(logA^), which means that all nodes in the graph are in dense 
cells. In that case, we do not need to define dummy nodes. We do not need to construct the 
backbone network either, because all nodes in the network have direct links to the sink node. 
Therefore, the number of broadcasts in a complete network is ©(A^ log log A^). 


C. Extension to Random Geometric Graphs and BECs 

A counterpart of Theorem [3] in random geometric graphs is the following corollary. This result 
generalizes the Theorem 2 in |l6l to cases when the connectivity range is larger than 
Note that in random graphs, we only care about the expected error probability = Eg[P®], 
which has been discussed at the end of Section [3] 

Corollary 4. ( Theorem 2])Suppose Q = {V,E) satisfies the topology assumption (A.3b) 
and the channel assumption (A.la). Suppose the parameters of Algorithm [2] satisfy the same 
conditions as in Theorem [3l Further assume that Cg > K Then, using the in-network computing 
scheme in Algorithm [2l we can obtain the identify function at the sink with high probability, 
and the number of broadcasts scales as max{©((igA^), ©(A^loglog A^)}. That is, the expected 
error probability Pi^^ = Eg [Pf ] goes down polynomially with N. 

Proof: One possible way to prove this corollary is to use the same idea in |13, which 
relies on the result that in a random geometric graph satisfying the assumption (A.3b), after 
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the tessellation step as shown in Fig. S each cell has 0(logiV) nodes with high probability. 
However, we present a different proof. 


pW = Eg[Pf] <Pr(^ is connected)Eg[Pf 1^ is connected] + [1 — Pr(^ is connected)]. (51) 

According to the conclusion of Theorem [3l we know that, as long as the randomly generated 
graph Q is connected, Pf decays polynomially with N. Moreover, we know from [l3]| that the 
random geometric graph is connected with polynomially decaying probability as long as Cs > P 
Thus, we obtain our claim. ■ 

Remark 5. The proof technique can be generalized easily to other extended random geometric 
graph distributions, if the connectivity assumption is satisfied with high probability. This is 
advantageous over the strict assumptions in |l6l, that nodes are all uniformly distributed . 

A counterpart of Theorem [3] with BECs can also be obtained. 

Corollary 5. Suppose Q = {V,8) satisfies the topology assumption (A.3a) and the channel 
assumption (A.lb). Suppose the parameters of Algorithm [2] satisfy the same conditions as in 
Theorem [3li.e., j^Er{e, R) > |, where Er{e, R) is the random coding error exponent of a BEC 
channel with rate R. Then, using the in-network computing scheme in Algorithm[2l we can obtain 
the identity function at the sink with high probability, and the number of broadcasts scales as 
max{0((igA^), ©(A^loglog A^)}. That is, the error probability Pi^^ = 

Proof: The proof is exactly the same as the proof of Theorem [3l except that the random 
coding error exponent is now of BEC channels instead of BSC channels. ■ 

VIE QC -3 Codes in a Eow-Diameter Graph 

In this section, we provide an in-network computing scheme when the graph diameter is low 
(in particular, when the average multi-hop distance dg is a constant) and the graph topologies 
are random, i.e., specifically, when the graph Q satisfies the topology assumption (A.4) and the 
channel assumption (A. lb). In this in-network computing scheme, the number of broadcasts 
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meets the general lower bound (|2T]P . with the assumption that dg has order 0(1). As noted 
in Seetion ITTT-Ai sinee we are dealing with random graph instanees in this seetion, there are 
two error probabilities assoeiated with an in-network eomputation seheme: the eonditional error 
probability Pf eonditioned on a given graph instanee and the expeeted over the ensemble. 
Note that, there might be eases in whieh the graph instanee is simply not eonneeted and 
is bounded to be one. In what follows, we will use the expeeted error probability P^ over all 
random graph instanees as the evaluation metrie. 

We reeall the assumption (A.4) of extended Erdos-Renyi-type graphs in Seetion ITTT-Ai We 
assume there are N nodes V = The graph Q = {V,£) is obtained as follows: each 

node pair {vi,Vj) G is connected with a directed link with probability pn = where 

c > 0 is a constant. All connections are assumed to be independent of each other. Link {vi,Vj) 
and {vj,Vi) are connected independently as well. Note that we allow self-loops, because each 
node can certainly broadcasts information to itself. Furthermore, there is a unique sink node 
Vq and each node is assumed to have a directed link to it, so that the sink can hear all the 
broadcasted information^ Each link is assumed to be a EEC with erasure probability e. That 
is, if one bit is erased, the receiver knows explicitly the erasure position. 

Note that since each node is connected directly to the sink, there is a naive scheme to achieve 
polynomially decaying error probability with N, i.e., each node transmits the self-information 
bit to the sink for 0(logA^) times. However, this naive scheme can only provide a solution in 
which the number of broadcasts scales as 0(A^log A^). This scheme is also feasible in complete 
graphs, but since it does not achieve the lower bound, even in complete graph settings, a 
more involved scheme was required in |[T|. As shown in |[5l, the data gathering problem in 
a complete noisy broadcast network has a communication complexity lower bound which scales 
as H(A^loglog A^), in order to achieve a constant error probability, even in a complete network 
where each node pair is connected. In what follows, we show that our proposed QC — 3 coding 


^Note that the lower bound ( 121b is for BSCs and the techniques we use here are for BECs. However, even if the algorithm 
in 111 is applied to a complete graph with BECs, the number of broadcasts still scales as log log A^^). Thus, our result is 
still better in that we allow non-complete graph topologies. 

'°This assumption has been discussed in Section IIII-AI In fact, we only require each node to have a bounded distance to 
the sink, which ensures that transmitting one bit to the sink has an erasure probability strictly less than 1 and the number of 
broadcasts required is 0{1). However, for conciseness, we only consider cases when direct links are present. 
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Fig. 6. Each code bit is the parity of all one-hop in-neighbors of a specific node. Some edges might be bi-directional. 


based in-network eomputing seheme aehieves polynomial deeay (in N) of the error probability in 
the above mentioned random graph settings and requires 0(A^ loglog iV) broadeasts. Therefore, 
our broadeasting seheme ean indeed aehieve the broadcasting communication complexity lower 
bound in order sense, and, moreover, in sparser graph settings. 


A. In-network Computing Algorithm 

In this section, an in-network computing algorithm with two steps is provided. During the 
first step, let each node broadcast its self-information bit to its out-neighborhood for t 

times, where 


log( 


c log N > 
Pch 


log(l/e) 

and Pch > 0 is a predetermined constant smaller than 1/2. Then, each node estimates each 
self-information bit from its in-neighbors. The next lemma provides the probability of a certain 
bit being erased when transmitted from a node v to one of its out-neighbors. This lemma is a 
counterpart result of Lemma [T] in BEC. 

Lemma 5. Suppose we have a BEC with erasure probability e. Then, the erasure probability of 
a bit that is repeatedly transmitted for t times on this channel is 

Pch 


Pe = C = 


(53) 


clog 

Proof: The proof follows immediately by substituting in 
After estimating each bit, each calculates the local parity. Suppose node receives the self¬ 
information bits from its in-neighborhood A/'“(Fn) and if all information bits are sent successfully. 


can calculate 


Vn ^ ^ ^ 

Vm&M- (Vn) 


(54) 
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where a„ is the n-th column of the adjacency matrix A, and the summation is in the sense of 
modulo-2. If any bit Xm is not sent successfully, i.e., erased for t times, the local parity cannot 
be calculated. In this case, Un is assumed to take the value ‘e’. We denote the vector of all local 
parity bits by y = [|/i, 1 / 2 ,...,?/at]I f all nodes could successfully receive all information from 
their in-neighborhood, we would have 

yT = x^A, (55) 

where A is the adjacency matrix of the graph Q, and particularly, a random matrix in this section. 

During the second step, each node Vn transmits its self-information bit Xn and the local parity 
Un in its in-neighborhood back to the sink exactly once. Denote the received version of the bit Xn 
at the sink by Xn- Denote the vector of all self-information bits at the sink by x = [xi, X 2 ,..., 

There might be ‘e’s in this vector. Apart from self-information bits, the sink also gets a (possibly 
erased) version of all local parities. We denote all information gathered at the sink by 

r = [xi,..., xtv, ^ 1 , ...^Vn] = [x"^, y"^], (56) 

where [^ 1 , ...,^Af] is the received version (with possible erasures) of all local parity bits y. That 
is, there might be some bits in y changed into value ‘e’ during the second step. If the channels 
were perfect, the received information could be written as 

rT = xT-[I,A], (57) 

which is exactly a channel control code with rate 1/2 and a generator matrix G = [I, A]. 
However, the received version is possibly with erasures, so the sink carries out the Gaussian 
elimination algorithm to recover all information bits, using all non-erased information. If there 
are too many bits erased, leading to more than one possible decoded values x^, the sink claims 
an error. 

In all, the number of broadcasts is 

= N ■t + 2N = N{2+ , ' ) = e(AloglogA), (58) 

log(l/e) 

where t is defined in (l52l) . and the constant 2 is introduced in the second step of the in-network 
computing algorithm, when the self-information bit and the local parity are transmitted directly 
to the sink. 
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Remark 6. Note that in the proposed network-eomputing algorithm, the sink only uses received 
information in the second step for reconstructing the identify function (all data). However, based 
on our assumption, all broadcasts during the first step reach the sink as well. Thus, effectively, 
the sink does not (directly) take into account the bits or erasures received in the first step for 
the purpose of decoding. This indicates that our algorithm could be additionally advantageous in 
application scenarios where inter-sensor broadcasts (broadcasts between among non-sink nodes) 
are cheap, and direct communications between sensors and the sink are expensive, as the number 
of inter-sensor broadcasts required by the in-network computation algorithm is ©(A^loglogiV), 
whereas, the number of direct communications between sensors and the sink is only 2N. 

B. An Upper Bound on the Error Probability 

In this subsection, we analyze the expected error probability of the previous algorithm. As 
defined in Section ITTT-Ai denote by Pf (x) the conditional error probability in gathering all data 
at the sink conditioned on a graph instance Q and self-information bit vector x. The expected 
error probability is defined to be Pe ^\ x .) = E , g [ P ^{ x .)]. In this section, we prove that Pe ^\ x .) 
converges to zero as —)■ oo for all x. 

From Section IVII-Al we know that an error occurs when there exist more than one feasible 
solutions that satisfy the version with possible erasures of (l57l) . That is to say, when all positions 
with erasures are eliminated from the received vector, there are at least two solutions to the 
remaining linear equations. Denote by xi and X 2 two different vectors of self-information bits. 
We say that xi is confused with X 2 if the true vector of self-information bits is xi but X 2 also 
satisfies the possibly erased version of (l57l) . in which case xi is indistinguishable from X 2 . 
Denote by Pf (xi —> X 2 ) the probability that xi is confused with X 2 . 

The Lemma [6] in the following states that P^ (x) is upper bounded by an expression which is 
independent of the argument x (self-information bits). 

Lemma 6. The error probability Pf can be upper-bounded by 

Pf(x)< Pf(x„->0„), (59) 

xTe{o,i}^\{Oiv} 

where Ojy is the A^-dimensional zero vector. 

Proof: See Appendix El ■ 


DRAFT 


Wednesday P* February, 2017 






41 


Each term on the RHS of (l59l) can be interpreted as the probability of the existence of a non¬ 
zero veetor input Xq that is eonfused with the all-zero veetor after all the non-zero entries of 
Xq • [I, A] are erased, in whieh ease Xq is indistinguishable from the all zero ehannel input. For 
example, suppose the eode length is 2N = 6 and the eodeword Xq • [I, A] = [xi, 0, 0, X 5 , xq] 

is sent and the output happens to be = [e, 0, 0, e, e, e]. In this ease, we eannot distinguish 
between the input veetor xj and the all-zero veetor 0^ based on the channel output. 

The Lemma |7] in the following states that the expeeted error of the error event diseussed above 
ean be upper-bounded. This upper bound is obtained by deeomposing the error event into the 
union of three error events on eaeh bit. 

Lemma 1. Define Eq = ( x-xje l)Pch + e, where e is the erasure probability of the BECs and 
Pch is a constant defined in (I52l) . Then, the expeeted error probability Pe^\x) = Eg[P^(x)] ean 
be upper-bounded by 


N 


= Esi/f (x)] < y; 


fc=i 


N 


^0 + (1 — ^ 0 ) 


1 + (1 - 2pY 


N 


(60) 


Proof: We will first show how to deeompose the error event mentioned in the above example 
to obtain an upper bound on the eonditional error probability Pf (x). Then, we show how to 
obtain an upper bound on the expeeted error probability Pi^^(x) = Eg[Pg®(x)]. Finally, we 
compute the expeeted error probability upper bound using random graph theory. 

1) Decomposing the error event conditioned on Q: The ambiguity event mentioned above, 
i.e., a non-zero veetor of self-information bits being eonfused with the all-zero veetor Oat, happens 
if and only if eaeh entry of the reeeived veetor is either zero or ‘e’. When Xq and the graph 
Q are both fixed, different entries in are independent of eaeh other. Thus, the ambiguity 
probability P^i^o —^ Oat) for a fixed non-zero input x|[ and a fixed graph instanee Q is the 
produet of the eorresponding ambiguity probability of eaeh entry in (being a zero or a ‘e’). 

The ambiguity event of eaeh entry may oeeur due to struetural defieieneies in the graph 
topology as well as due to erasures. In partieular, three events eontribute to the error at the 2 -th 
entry of r^: the produet of Xg and the i-th eolumn of [I, A] is zero; the i-th entry of is ‘e’ 
due to erasures in the first step; the i-th entry is ‘e’ due to an erasure in the seeond step. We 
denote these three events respeetively by A^*^(x^), A 2 *^(xq ) and A 3 ^(xq ), where the superseript 
i and the argument Xg mean that the events are for the i-th entry and eonditioned on a fixed 
message veetor Xg . The ambiguity event on the z-th entry is the union of the above three events. 
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Note that the first event is due to structural deficiency, while the second and the third events 
are due to erasures. Therefore, by applying the union bound over all possible inputs, the error 
probability T’f(x) can be upper bounded by 

2N 

xTe{0,i}^\{oJV} i=i 

In this expression, ^ is a random graph. The randomness of Q lies in the random edge connec¬ 
tions. 

2) Decomposing the unconditioned error event: We will further show that 

2N 

F™(x)=E5[Ff(x)]< ^ (62) 

xTe{o,i}^\{oJV} i=i 

We use a set of random binary indicators {Emn}m,n=i to denote these edges, i.e., Emn = 1 
if there is a directed edge from node Vm to Vn- Note that we allow self-loops, because each 
node can certainly broadcasts information to itself. By Assumption (A.4), all random variables 
in {Emn}Z,n=i ^0 mutually independent. Since in the in-network computing algorithm, the self¬ 
information bit Xi and the local parity bit yi is only calculated based on the in-edges of Vi, i.e., 
the edge set = {Eni\l < n < N}, we obtain 


Pr[AW(x;[) U ) U )|^] = Pr[Ai*^(X(j) U ) U A'i’{^^)\E^i, 1 < n < N]. 












Thus 


Yl Pr [aP (x;[) U (x;[) U A^*^ (x^^) I g] 


i=l 

2N 


= nPr[Af^(xo) U A^*^(x^) U Aj^(x^)|E„i,l < n < A^]. 


(63) 


Note a bidirectional edge in the current setting corresponds to two independently generated 
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< ^ Eg 


27V 


JJ Pr[A^*^ (X([ ) U (x;[) U 4*^ (4) 1^] 


2=1 


(a) 


2N 


Y1 n [Pr[^i (x^) U (x;[) U {xo)\Eni,l <n< N] 
xTe{o,i}^\{o^} *=i 
27V 

Yi n ^^ ^2 ^ ^^0) u ( 4 )]> 

xTe{o,i}^\{o^} *=i 


(64) 


where the equality (a) follows from the fact that the sets {T^ni}i<n< 7 v and {i?ni}i<n< 7 v are 
independent (by the link generation hypothesis) for any pair {i,j) with i ^ j. 

3) Computing the expected error upper bound using random graph theory: 

Lemma 8. Define k as the number of ones in Xg and Eq = {-[zyjiEC)Pc\i+^, where e is the erasure 
probability of the BECs and Pch is a constant defined in (15^ . Further suppose clogiV > 1. Then, 
for 1 < i < iV, it holds that 

N 

JJPr[4^(x4 u4^(x4 u4^(x4] = (65) 

2=1 

For N + 1 < i < 2N, it holds that 


Pr[/lf^(x^ ) U 4^(xcJ) U 4*^(x^ )] < £o + (1 - £o) 


16 )/ 


W, 


1 + (1 - 2py 


( 66 ) 


where p is the connection probability defined in Assumption (A.4). 

Proof: See Appendix O ■ 

Based on Femma [8] and simple counting arguments, note that (l62l) may be bounded as 

pA(x) < f (: 

k=l ^ 

where the binomial expression is from the fact that there are (^) codewords xq with k ones. 
Thus, we conclude the proof. ■ 

By respectively analyzing the upper bound in Femma |7] for k = o and k = fl 

we obtain the final error bound as follows. 


^0 + (1 ~ ^o) • 


1 + (1 - 2p) 


kl 


TV 


(67) 
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Theorem 4. Suppose the graph Q satisfies the topology assumption (A.4) and the ehannel 
assumption (A.lb). Suppose <5 > 0 is a eonstant, Pch £ (0) |) is a eonstant, e is the ehannel 
erasure probability and Sq = {-[ztjl + ^)Pch + e- Assume clog> 1. Define 

1 1 _ e-2c<5 

bs = -{1 - eo){l -^-), ( 68 ) 

and assume 

e <bs. (69) 


Then, for the transmission seheme in Seetion IVII-Al we have 

( /Y2-c(l-eo)(l-c(5) 

< I (1 - bsf+6ee^ 


(70) 


log 

That is to say, if 2 < c(l — £o)(l — c5), the error probability eventually deereases polynomially 
with N. The rate of deerease ean be maximized over all 5 that satisfies 


Proof: See Appendix |Dl ■ 

Remark 7. The QC-3 eode is “eapaeity aehieving” in some sense, in that this eode has rate and 
this eode ean be used even when the erasure probability e ~ Consider the ease when e = A, 
where A is a small eonstant. In Theorem 4, ehoose 5 = ^ and Pch = ^—tt- In this ease, the 

2c -Ccti + 

eonstants in Theorem 4 satisfy Sq = e + A = | —A, and 2bs > (1 — £o)(l “C(5) > l — SQ — cb = 
Then, the error probability upper bound in Theorem 4 ean be simplified to 


/^-^<(l-(^-(^-A)))-+^(l 


A) 


^2-c(i + f){l-f) 

log A^ 


<(1-A) 


N 


eA A^ 2 -c(|+f) 


(71) 


4c log N 

which decays polynomially with N for all small A > 0 and c > 4. 

However, consider using QC-2 code in a complete graph with BEC channels. From Corol¬ 
lary [51 the error probability of QC-2 code in a complete graph can be shown to be = 
( 9 (Ar- 2 pE,.(e,i)+|), Pr(e, I) is the random coding exponent for a BEC with erasure 

probability e and code rate I. In the case that e —>■ I, i.e., the capacity achieving limit, Pr(e, |) 
vanishes, and hence the QC-2 code requires a much denser network (it requires p > 
than the QC-3 code (it only requires c > 4). 

Interestingly, the result of Theorem |4| implies a more fundamental result for erasure codes. 
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Corollary 6. For a discrete memoryless point-to-point EEC with erasure probability e, there 
exists a systematie linear eode with rate- 1/2 and an iV x 2N generator matrix G = [I, A] sueh 
that the bloek error probability deereases polynomially with N. Moreover, the generator matrix 
is sparse: the number of ones in A is 0{N\ogN). 

Proof: The proof relies on building the relation between the QC-3 graph eode and an ordinary 
error control code. We eonstruct the error control code as follows: 

• Construet a direeted Erdos-Renyi network Q = (V,£^) with N nodes and eonneetion 

probability p = where c is a eonstant which will be defined later. 

• Construet a linear code with the generated matrix G = [I, A], where is the adjaceney 

matrix of the direeted network in the previous step, i.e., the entry Am,n = 1 if and only if 
Vm is eonneeted to 

The number of edges in is a binomial random variable distributed aeeording to Binomial(A^, p). 
Using the Chernoff bound [l431l . we obtain 

Pr(|^| > 2pN'^) < exp(-^A2) = (72) 

Then we use the eode construeted above to encode N binary bits and transmit the encoded bits 
via 2N parallel BECs to the reeeiver. Denote by the event of a bloek error on the reeeiver 
side. Define = Pr(Ai^^) as the bloek error probability. Note that 

pW = E [pS] ^ (73) 

where Pf = Pr I is the bloek error probability conditioned on the graph instance Q. 

In other words, Pi^^ is the expeeted bloek error probability of an ensemble of eodes eonstrueted 
based on direeted Erdos-Renyi networks. 

Clearly, this point-to-point transmitting seheme is the same as carrying out the in-network 
eomputing algorithm in Seetion IVII-Al exeept that the eneoding step in the point-to-point ease 
is eentralized instead of being distributed. This is equivalent to the in-network eomputing seheme 
when channels between neighboring sensor nodes are without erasures and erasures happen only 
when eommunieating over the ehannels to the deeoder (eompare with the seeond step of the in- 
network eomputing algorithm). Sinee erasure events constitute a striet subset of those eneountered 
in the in-network computing scheme, the upper bound on the error probability in Theorem |4] still 
holds, whieh means that the expeeted bloek error probability Pi^^ goes down polynomially when 
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the constant c designed for the connection probability p = satisfies the same condition in 
Theorem IH Note that 


piN) ^ Pr(4^)) = Pr(|£| > 2pN^) Pr \ \8\ > 2pN^) 

+ Pr(|£| < 2pN^) Pr {Af'> \ \S\ < 2pN^) . 


(74) 


Thus, combining (1741) with (1721) and (iTOl) . we conclude that the block error probability conditioned 
on \£\ < 2pN‘^, or equivalently Pr(y4i^^||£^| < 2pN^), decreases polynomially with N. This 
means that, by expurgating the code ensemble and eliminating the codes that have more than 
2pN'^ = 0{N log N) ones in their generator matrices, we obtain a sparse code ensemble, of 
which the expected error probability decreases polynomially with N. Therefore, there exists a 
series of sparse codes which obtains polynomially decaying error probability with N. ■ 


Remark 8. In fact, the QC-2 code also satisfies all required properties in this theorem. This fact 
is mentioned implicitly in ll29l . Therefore, the ^C-3 code can serve as another instance of sparse 
codes that satisfy these properties. 


We simulate the QC-T) code with different code lengths in an extended Erdos-Renyi network. 
The ratio of successful identity function computing at the sink node is compared with the number 
of broadcasts during the entire in-network function computing scheme (see Section IVII-AI for 
details), including t in-network broadcasts in the first phase and 2 transmissions to the sink node 
in the second phase. We can see from the simulation result that the number of broadcasts at each 
node required for successful identity function computing almost does not change for different 
network size. This is because the required number of broadcasts is (P(loglogiV) at each node, 
and hence it increases very slowly with the code length or the number of nodes in the network. 


C. The Degree Lower Bound for the QC-3 Graph Code 

In this part, we prove that p = 0(^^^) is the minimum connection probability that gives the 
polynomial decay of error probability in Theorem |4l In fact, we will prove a worst-case result 
for the total number of edges in the computation graph Q: the number of edges in the network 
must be ^( i^g\og^ )- This result suggests that, despite a negligible ratio the connection 

probability p = ig optimal in terms of sparseness. Since the worst-case result is for a fixed 
graph, we require the connectivity assumption (A.2). 
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Theorem 5. Suppose the ehannel assumption (A. lb) holds. Suppose the algorithm in Section IVII-AI 
is carried out. Then, if lim = 0, it holds that 

N^oo 




Ariog(Ar/pW), 


log log N 

where \S\ denotes the number of all directed edges in the edge set £. 


(75) 


Proof: During the first step of the algorithm in Section IVII-Al each self-information bit is 
broadcasted for t times. Therefore, for a node the total number of possibly erased versions 

N 

of Xn is dnt where dn = (!;,„)}• Each directed edge is counted once, so we have 


Y,dn=\£\- (76) 

n=l 

During the second step of the algorithm, each self-information bit Xn is transmitted to the sink 
once. For any Xn, the probability that all dnt + 1 copies of Xn are erased is 


Pn = e 




(77) 


If this event happens for any Xn, the identity function cannot be computed reliably, because at 
least all possible information about Xn has been erased. Thus, we have 

N 

> I (78) 

n=l 
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Fig. 1. Network transformations that relate coding theory to noisy broadcast networks. 


Based on 1 — x < exp(—x) and the fact that arithmetic mean is no less than geometric mean, 
we have 


N 


1 _ pW < p[(i - p„) < 


n=l 


1 


N 


N 


N 


N 


]v5Z(i-P" 

n=l 

N \ ^ / JV 

W E dnt+1 \ ( W 12 dnt+1 

< I 1 — e "=i 1 < exp —N ■ e '^=1 


n=l 


(79) 


which can be translated into 


N 


(tdn + 1) > iV 


n=l 


log N - log log(l/(1 - 
log(l/e) 


(80) 


When lim = 0, it holds that — loglog(l/(l — Pi^^)) = 0(log—^). Therefore, jointly 

TV^OO Pe 

considering (l52l) . we get 

(8.) 

71=1 


Tog(clogiV/pch) 


Remark 9. Note that the lower bound (1751) holds for individual graph instances with arbitrary 
graph topologies, instead of holding for certain ensemble average. 

Similar with Theorem |4] and Corollary 0 Theorem [5] also implies a result in point-to-point 
coding theory, but the proof is not obtained by directly applying Theorem [5l We have to carry 
out a series of network transforms, as shown in Fig. |7l 

Corollary 1. For a rate-1/2 linear block code with an iV x 2N generator matrix G = [I, A], if 
there are dn ones in the n-th column of A, then, the code is asymptotically good for a point- 
to-point discrete memoryless BEC with erasure probability e, i.e., the block error probability 
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lim Pe =0, only if 

Af^.oo 

N 

Y,dnlogdn = n{Nlog{N/P^^^)). (82) 

n=l 

Proof: Suppose we have a eode G = [I, A] that satisfies the eonditions in this eorollary. 
As shown in Fig|7]^a), eonstruet a direeted graph Q = (V,£^) with the following procedures 

. Set |V| = A; 

• Connect a directed edge from the node Vm to the node Vn if Am,n = 1, where m can be 
equal to n, in which case a directed self loop is constructed; 

• Assume each edge is a noiseless channel. 

After constructing the graph, construct an extra node Vq to be the sink, and connect each node 
to the sink. The links to the sink are all assumed to be discrete memoryless BECs with identical 
erasure probability e. Suppose in the network constructed above, each node Vn ^ V carries a 
self-information bit Xn- Then, we can use the in-network computing algorithm in Section IVII-AI 
to gather all sensor measurements at the sink vq. Clearly, what the algorithm does is encoding the 
information vector x with the generator matrix G = [I, A] (see (1571) 1 and sending the encoded 
message through 2N parallel BECs to the sink. Until now, the inter-sensor edges in £ are all 
noiseless. The only noisy edges are from sensors to the sink, which means in the first step of the 
in-network computing algorithm, instead of broadcasting each self-information bit for t times (as 
defined in (I52l) l. each node only needs to broadcast once. Therefore, the in-network gathering 
of all data in the constructed network is equivalent to the encode-and-decode procedure with the 
block code G = [I, A] on a point-to-point link, and hence they have the same error probability 

Now, modify the constructed network by assuming that links from all sensor nodes to the 
sink are noiseless when transmitting the parity bits. That is, in the second step of the in-network 
computing algorithm, these sensor-to-sink links are only noisy when self-information bits are 
transmitted. However, assume that the links between sensors are noisy, as shown in Fig |7];b). 
Specifically, for each node assume that all the directed links from the in-neighborhood A/^“ 
are changed into BECs with identical erasure probability e/dn, where dn = \Af~{vn)\- Now that 
the local parity that Vn sends to the sink is erased with probability 1 — (1 — < e, therefore, 

if the original network can gather all data with error probability Pe^\ the transformed network 
can compute it with error probability strictly less than Pe^K 
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Now make a further ehange as shown in Fig ITJ^c), which consists of substituting each sensor- 
to-sensor link with erasure probability e/dn to a set of [1 + parallel links with erasure 

probability e connected to a merging gate. This gate claims an ‘erasure’ only if all bits in 
the incoming edges are erased. This transform is exactly the same as repeatedly transmitting t 


times of the same bit as defined in (15^ . After this transform, the erasure probability changes 


to < e/dn- Similarly, if the original network can reliably gather all data with error 

probability Pe^\ the new network can also compute it with lower error probability. 

Therefore, if the block code G = [I, A] can be used to successfully transmit all bits on a 
point-to-point EEC with error probability Pe^\ data gathering in the transformed network shown 
in Fig |7];c) can be reliably completed with lower error probability. By Theorem |5l to achieve 
error probability Pe^\ the degree of the transformed network should satisfy 


log d' 


N 


+>iv- -\oghg{l/{I - P^^^)) 


n=l 


log(l/e) 


log(l/e) 


(83) 


This implies that (1^ holds. ■ 

This corollary suggests that, if one wants to find a sparse linear block code for BECs, then (f8^ 
can serve as a lower bound on ‘sparseness’. Moreover, if the matrix A has the same number 
of ones in each column, then, there are (l( iogio^Af ) in each column, in order for (f82l) to 
hold. A similar result was obtained in ||2^ . which states that max dn is at least f2(logn), in 

l<n<N 

order to achieve an error probability strictly less than 1. However, this result is obtained only 
for the maximum value dn, which does not capture the total sparseness. Moreover, our result is 
in coding theory but relates to distributed encoding as well. 


VIII. Conclusions 

In this paper, we obtain both upper and lower scaling bounds on the communication complexity 
of data gathering in arbitrary noisy broadcast networks. In particular, using different graph-based 
distributed encoding schemes, which we call graph codes, we find two special graph topologies, 
i.e., geometric graphs and extended random Erdos-Renyi graphs, in which the upper bounds on 
the number of broadcasts obtained by graph codes meet with the general lower bound in order 
sense. Furthermore, the analysis techniques of the third graph code is used to construct a sparse 
erasure code that is used in point-to-point communications. We also use cut-set techniques to 
show that the obtained code is almost optimal in terms of sparseness (with minimum number 
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of ones in the generator matrix) exeept for a log log multiple gap, sueh that the bloek error 
probability approaehes zero in the limit of large eode length N. However, quite a few open 
questions worthy of further researeh remain. For instanee, an issue with the QC-3 eode proposed 
in this paper is that it ean be analyzed only in EEC networks. The teehnieal diffieulty is that 
the “effeetive ehannel noise” is determined by the graph strueture and henee is dependent of the 
eode strueture itself. The analysis of QC-3 eode in EEC networks as aehieved in this paper is 
feasible beeause the upper bound on error probability ean be deeomposed as the produet of the 
error probability of eaeh partieular bit. However, this deeomposition eannot be readily obtained 
for ESC ehannels (and networks). A meaningful direetion is to understand and eharaeterize this 
effeetive ehannel noise for analyzing QC-3 eodes in ESCs. The foeus of this paper has been 
primarily on the design of eodes that minimize the broadeast eomplexity, i.e., the number of 
broadeasts required to aehieve funetion eomputation. Other praetieal metries sueh as the energy 
of broadeast (whieh, depending on the network strueture, is somewhat indireetly related to the 
number of broadeasts) may be of interest in applieations too. An extension of QC-3 eodes from 
an energy minimization perspeetive is provided in a follow up eonferenee paper ll46l . 

Appendix A 

Proof of ([HI) in Theorem [2] 

Sinee the eode length at eaeh node w G Sr is 7 log N, aeeording to Lemma [2l the deeoding 
error probability is 

Pe,^ < exp[-(7logAr + l)P,(e,P)/P] = exp[-P,(e, (84) 

Similarly, the deeoding error probability at a node v G Ar is 

Pe,. <exp[-(P, + l)P,(e, R)/R] < exp[-^ logiVP,(e, P)] < (85) 

it 

where we used the faet that the message size in v is greater than or equal to 7 log N, and 
henee we ean find a eode with length [(P^, + 1)/P] > ^ log A^. 

Combining (l84l) and (l85l) and using the union bound, the error probability is bounded as 
follows 

P]^) < Pe,^ < iV • + N ■ exp[-P,(e, P)/P]Ar-^®’-(^’-^)/^ 

veAr v&Br ( 86 ) 

. (1 + exp[-P^(e, R)/R ]). 
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When the eondition R < jErie, R) is satisfied, the error probability in ([8^ satisfies the property 
that liniTv^oo = 0 and the eonvergenee rate is polynomial. This eoneludes the proof. 


Appendix B 
Proof of Lemma 

We know from the union bound that 

Pe{^)< (87) 

x7e{0,ip\{x} 

Lemma 9. The probability that xi is eonfused with X 2 equals the probability that xi — X 2 is 
eonfused with the A^-dimensional zero veetor Oat, i.e., 

pf (xi -A X 2 ) = pf (xi - X 2 Oat). (88) 

Proof: We define an erasure matrix E as a 2A^-by-2A^ diagonal matrix in whieh each 
diagonal entry is either an ‘e’ or a 1. Define an extended binary multiplication operation with 
‘e’, which has the rule that ae = e,a G {0,1}. The intuition is that both 0 and 1 become an 
erasure after being erased. Under this definition, the event that xi is confused with X 2 can be 
written as 

x7-[I,A]-E = xJ-[I,A]-E, (89) 

where a diagonal entry in E being ‘e’ corresponds to erasure/removal of the corresponding linear 
equation. We know that if the erasure matrix E remains the same, we can arrange the two terms 
and write 

(x7-xJ)-[I,A]-E = 0T •[I,A]-E. (90) 

That is to say, if xi is confused with X 2 , then, if all the erasure events are the same and the 

self-information bits are changed to xi — X 2 , they will be confused with the all zero vector 
Oat and vice-versa. Thus, in order to prove (l88l) . we only need to show that the probability of 
having particular erasure events remains the same with different self-information bits. This claim 
is satisfied, because by the BEC assumption the erasure events are independent of the channel 
inputs and identically distributed. ■ 

Thus, using the result from Lemma [9l we obtain 

Pe®(x)< ^ Pf(x-Xi ^ Ojv), (91) 

xTe{0,i}^\{x} 

and hence, (l59l) holds. 
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Appendix C 
Proof of Lemma [8] 


First, we notice that for 1 < i < iV, the vector received is the noisy version of Xq . Since, 
according to the in-network computing algorithm in Section IVII-A[ the vector x^ is obtained in 
the second step, the event Ag ^ (xq ) is the only ambiguity event. Moreover, if the i-th entry of Xq 
is zero, it does not matter whether an erasure happens to this entry. Thus, the error probability 
can be calculated by considering all the k non-zero entries, which means 

N 

n Pr|A®(xj) U Af (xj) U (xj)] = 6 *. 

For -f 1 < i < 2N, A 3 ^(xq ) is the erasure event during the second step and is independent 
from the previous two events A^*^(xq ) and A 2 ^(xq ). Therefore 


Pr 
< Pr 


=1 — e -f e ( Pr 


|— o 


U AJ^(x^) 


1 - 1 

“3 


+ Pr 

1 1 

“3 

1 _ 1 

Pr 

e -f ePr 

'a^ 

(X([) U A^*^(x;[) 








(92) 




-I- Pr 






The event Ag*^(xQ) happens when the local parity Xq a* equals zero, i.e., in the k locations of 
non-zero entries in Xg, there are an even number of ones in the corresponding entries in a*, 
the i-th column of the graph adjacency matrix A. Denote by I the number of ones in these k 
corresponding entries in a*. Since each entry of a* takes value 1 independently with probability 
p, the probability that an even number of entries are 1 in these k locations is 

Pr[AS*^(xo )] =Pr[/is even] = p\l - pf-^ ^ ^ . ( 93 ) 

I is even 


The event (Ag*^(xQ ))*" n A 2 ^(xq ) indicates that I is odd and at least one entry of all non-zero 
entries in Xg is erased. Suppose in the remaining N — k entries in a*, j entries take the value 
1 and hence there are (/ -I-j) I’s in cq. Therefore, for a fixed I, we have 


Pr[(/l<->(xJ))‘=n4)(x3')|i] [1 - (1 -p,)'N 

j=0 V J / 

j +i)Pe, 
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where p is the edge eonnection probability and Pe is the probability that a certain bit in xq is 

log(£i 2 &^) 

erased for t = times when transmitted to Vi from one of its neighbors during the first 

step of the algorithm. Combining the above inequality with Lemma [51 we get 


N-k 


Pr[(A«)^nA«( 0 ]< 5 ^ 

i=o 

N-k 


N-k\ _.N-k-jn . Pch 


3 J c log iV 


=l 


Pch 


clogiV 


i=o 


N-k 


3 


N-k 




clogiV-J^ V 3 


N-k 


p3{l -p) 


N-k-j 


{a)j^ Pch PchP 


N-k 


clogiV clogA^ 


y](iv-i) 


N-k-1 
3 - 1 


-Li _ 


p3 (l-p) 


=l 


Pch , Pch{N -k)^fN-k-l^ 


clogiV 


+ 


_; Pch I 

—I— -7T + Pch • 


N 

N-k 


N-k 

E 

J =1 


J - 1 


Jp (l-p) 


N-k-j 


clogiV ' N ’ 

where step (a) follows from = {N — Therefore 


n Af] 


I is odd 


E (i)p'(l-p)‘"'Prl(4‘EnA®( 


I is odd 


< E , p'(i-p)‘''(' 


k-ln P^^ 


1 


J 

I is odd 

N-k 


clogiV 

k-L^ N — k 
N 


+ Pch 


N-k, 

N ' 


=Pch ■ 


N 


I is odd 


I is odd 

k-l , 


l[ ]p\l-p) 


k—l Pch 


E / ^ 1 Li A-i I '^yych 

I p(i-p) +m;;7a7E 


c log N ^ \ ^ — 1 

° l is odd ' 


k-l 


clogiV 

p^-\l-p) 


k-l 


=Pch ■ 


N — kl — {1 — 2pY 


N 


+ Pch ■ 


k l + {l-2p) 
N 2 


k-l 


(a) 1 - (1 - 2 p)^ 

< Lpch-^-, 


where the constant L in step (a) is to be determined. Now we show that L = + 1 suffices 

to ensure that (a) holds. In fact, we only need to prove 

- fc 1 - (1 - 2pf fc 1 + (1 - 1 - (1 - 2pf 

-r- V -rz -;- L- 


N 


N 
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Since < 1, it suffices to show that 


N 




N 


Since (1 — 2p)^ ^ < 1, it suffices to show that 


N - ^ ^ 2 


or equivalently, 


We know that 


2k 


1 - (1 - 2py 


< iV(L-l). 


(94) 


1 — (1 — 2p)^ > 2kp — Cl{2py = 2kp — 2k{k — l)p^ = 2kp [1 — p{k — 1)] > 2kp{l — kp). 


Thus, when kp < \, 1 — {1 — 2p)^ > 2kp{l — kp) > kp and 


2k 


2k 2N 




l-{l-2p) kp c\ogN 


< 2N, 


when clogiV > 1. When kp > 4, (1 — 2p)^ < (1 — 2p)'^p < 4 and 


2k 


2k 2N 

< - < 


l-(l-2 p)'^ - 1-1/e - 1-1/e' 

Thus, as long as L > 1 + (l94l) holds. Jointly considering (|9^ . we get 

P,I^<-) u 4'>] < 


Combining (l92l) . we finally arrive at 

Pr[AP U U A^'^] <e + (1 - e) 


i±(l^ + 


— e + (1 - e) 


1 - (1 - Lpch) 


1 - (1 - 2py 


= 1 - (1 - e) (1 - Lpch) 


1 - (1 - 2py 


< 1 - (1 - e - Lpch) 


1 - (1 - 2py 


= 1 - (1 - e - Lpch) 


1 - 


1 + (1 - 2py 


= e + Lpch + (1 - e - Lpch) 


1 + (1 - 2py 


— eo + (1 ~ ^o) 


1 + (1 - 2py 
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where Eq = Lp^h + e. 


Appendix D 
Proof of Theorem [4] 


We will prove that for any 5 > 0, it holds that 

p{N) < (1 _ bsf+6ee 




(95) 


logiV 

As shown in what follows, we bound the right hand side of (l60l) with two different methods for 
different /c’s. First, when k satisfies 

N 


1 < k < 6 


define 


Then, based on the inequality 


we have 


U = A^(l - £o) 


log A^’ 

1 - (1 - 2p)^ 


(1 - y<e ,Va:e(0,1], 

X 


I /'I , ^ ^ 5” lAf /'I ^ \N \i-\ ^ \—^u ^ 

N + (l-eo)-^-J 


From the Taylor’s expansion, we get 

(1 - 2pf = l-2pk + ~ 


e\ee[0,2p]. 


By applying the equation above to (1971) . we get 

u = N{1 — eo)[kp 

Therefore, we have 




r]. 


\ ck{l-£o) 2 

N 


(96) 


(97) 


(98) 


(99) 
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Plugging the above inequality into (1991) . we get 


( + (1 ~ ^o) 


1 + (1 - 2pf 






< ^ e" — 


\k J \N 


-c(1-£o)(1-C(5) 


-k 

where the last inequality follows from 
Seeond, when k satisfies 


k>5 


N 


logiV’ 


we ean direetly write 


(1 - 2pf = [(1 - 


Therefore, it holds that 


fc>^I3iTv 




- + 


1 + e 


-2c5 


iN 


-,—2cS 


<|£„ + (1-T„)i±|—]" 


N 

E 

k=0 


N' 


^—2c5 


— [^0 + (1 ~ ^ o )-;;- ]^(1 + c ) 


N 


= [(l-(l-^o) —" 
<{l-[(l-£o)(l- 
={l - {^bs - e)Y. 

When (1691) holds, we have 


-2cS 


-)(1 + ^)] 


N 


1 — e 


-2c5 


e]} 


N 


E 


k>S 


N 

log N 






k J c log N' 


N 


( 100 ) 


( 101 ) 


( 102 ) 
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Combining (1^ and (llOOl) . we get 


Pi"’< (1 - 6.)"+ E + 

1 N V / 


N 








^<5l8iTv 


< (1 - bsf + 

logiV k 


< (1 - bs)^+5ee 


^2-c(1-£o)(1-c<5) 

log 


When 2 < c(l — eo)(l ~ c5), the right hand side decreases polynomially with N. 
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